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MEASURING PROPAGANDA EFFECTS WITH DIRECT AND 
INDIRECT ATTITUDE TESTS 


JACK A. PARRISH AND DONALD T. CAMPBELL 
The Ohio State University * 


HE present study” is in the tradition 

of the attitude-change studies which 

started in the 1930’s under the inspira- 

tion of the first efforts to measure social atti- 

tudes (e.g., 5, 9). Claims to novelty within 

this experimental approach must be restricted 
to three features: 

1. An attempt is made to study propaganda 
effectiveness through the use of disguised 
attitude measures. 

2. The relative effectiveness of two typical 
radio styles used in informational programs is 
studied: the monologue speech versus the 
dialogue or interview-type presentation. 

3. The traditional pretest design is aban- 
doned in favor of matched groups given a 
posttest only. 


METHOD 
Propaganda Materials 


In the spring,of 1948 when the data were col- 
lected, “the Kuomintang Government of China” 
seemed an appropriate topic for the purposes of an 
experiment on propaganda effectiveness. This topic 
had the advantage of controversiality and at the 
same time was remote enough that there seemed 
little likelihood of encountering unduly rigid pre- 
conceptions upon the part of the experimental sub- 
jects. Based upon a survey of writings on the 
subject, a common skeleton of facts was prepared. 
This common core was utilized in the preparation 
of four scripts, which we shall label Pro-Kuomintang 
Monologue, Pro-Kuomintang Dialogue, Anti-Kuo- 
mintang Monologue, Anti-Kuomintang Dialogue. 
All four were prepared in the format of the 
familiar “Campus Visitor” program of the Ohio 
State University ratio station, WOSU. The tran- 
scriptions were made by graduate students in 
radio speech® and were of professional quality. 
All transcriptions began “And now, WOSU pre- 


1 The second author is now af the University of 
Chicago. 

2This article draws primarily from the master’s 
thesis of the senior author (8), done under the sponsor- 
ship of the second author and of Dr. John R. Kinzer, at 
Ohio State University. The authors are indebted to the 
Ohio State Psychology Department for general coopera- 
tion and for clerical assistance which made possible 
analyses not included in the original report. 

8 The authors are indebted to Richard Mall, Phillip 
J. Mohr, and James Grissinger for assistance in this 
regard. 


sents ‘Campus Visitor,’ your weekly opportunity to 
hear the views of visiting experts on significant 
topics. Our guest for this Sunday’s program is Mr. 
Walter C. Eddington, well-known foreign corre- 
spondent . . . for many years an authority on far- 
Eastern affairs. During the recent war he served in 
China as correspondent of the New York Herald 
Tribune . . .” etc. For the monologue presenta- 
tions Mr. Eddington, once intoduced, gave his com- 
ments without interruption. For the dialogue pres- 
entations, the same material was covered through an 
interchange of questions and answers between the 
announcer and Mr. Eddington. The pro and the 
anti scripts covered the same facts but differed in 
the interpretation made of them. The anti-Kuomin- 
tang scripts attacked the Chiang Kai-shek govern- 
ment but were also anti-Communist and supported 
the U. S. policy. The pro-Kuomintang scripts inev- 
itably made use of the argument of the Communist 
menace in China. All scripts were approximately 
12 minutes in length. 


Test Materials 


Three types of attitude tests were designed. 
The first two were intended as indirect indicators 
of attitudes, following a line of reasoning presented 
elsewhere (1). These were labeled “Test of Your 
Information on the Situation in China” and “Judg- 
ment of Public Opinion.” On the third test the 
respondents were asked to “Indicate your own per- 
sonal attitudes about China.” Examples of each of 
these three types of items are the following: 

Test I. 

2. Chiang Kai-shek and the Kuomintang gained 
control of China in 1927 by what means: (@) popu- 
lar election, (6) representative election, (¢) replace- 
ment of a deceased leader, (d) political party split, 
(¢) armed revolution. 

18. How many parties are authorized by the 
Kuomintang government of China? (a) no limit, 
(4) four, (¢) three, (d@) two, (¢) one. 

Test II. 

36. In April, 1948, the Gallup Poll asked the 
American public the following question: “Do you 
approve or disapprove of the U. S. giving the 
Chiang Kai-shek (Nationalist) government more 
military supplies, goods, and money?” What per- 
cent answered “approve”? (a) 13%, (6) 28%, (c) 
40%» (d) 56% Ce) 71% 

40. In April, 1947, 120 foreign correspondents 
who had served in China during the war were 
asked: “Do you approve or disapprove of the 
present Kuomintang cabinet?” What percent an- 
swered “disapprove”? (@) 32%, (6) 43% (c) 56%, 
(d) 71% (e) 82%, 
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Test Ill. 

46. The U. S. should help the Chinese Kuomin- 
tang (Nationalist) government destroy the Chinese 
Communists: (a) strongly agree, (6) agree, (c) 
neither agree nor disagree, (d) disagree, (¢) strongly 
disagree. 

47. I think that inflation in China is a direct result 
of corruption within the Kuomintang (Nationalist) 
government: (a) strongly disagree, (4) disagree, (c) 
neither agree nor disagree, (d) agree, (¢) strongly 
agree. 

In Tests I and II an effort was made to select item 
alternatives which could be graded according to 
their favorableness to the Kuomintang government. 
For Test II, the underlying hypothesis was that 
the larger the majority an individual perceives 
to be in favor of a given proposition, the more 
likely he is to agree with the proposition. Or, stated 
in a different manner, people tend to overesti- 
mate the proportion of individuals who hold the 
same attitudes to which they themselves subscribe. 
Evidence in support of this hypothesis is cited 
elsewhere (1). In the scoring of all three tests 
a “Likert-type” weighting of alternatives has been 
used. Thus the most strongly anti-Kuomintang alter 
native to each question received a weight of 5 
(this would be alternatives ¢, ¢, a, ¢, ¢, and ¢ respec- 
tively in the above six examples). The pro-Kuomin- 
tang extreme received a weight of 1. For the scor- 
ings reported in the present article, the correctness 
of the answers to Tests I and II has been disregarded 
and the correct answer given a weight in line with 
its position in relation to the other alternatives. 
This can be justified by two reasons. First, it was 
assumed that for the present population, most of the 
items were sufficiently difficult that few if any 
persons would have the detailed information at their 
command, enabling them to select “right answers.” 
Second, selective knowledge of information which 
tends to support a given argument is, in itself, 
diagnostic of attitude (7). All scoring was based 
upon this a priori weighting of the items. There 
were 32 scored items in Test I, ro in Test II, and 
20 in Test III. 


Experimental Design 


The traditional pretest and posttest, experimen- 
tal and control groups design has one weakness 
which has been generally overlooked. Strictly speak- 
ing, as a logical model it provides a basis of gen- 
eralization only to other pretested populations. This 
limitation, trivial in many experiments, may be 
expected to be important in attitude change studies. 
The pretest must inevitably create a set or focus 
attention on the topic under study, making the 
pretested audience no longer typical of audiences 
in general. It likewise will in many situations alert 
the audience to the intentions of the experimenter 
and propaganda. For those researchers interested 
in generalizing to the un-pretested audience, it 
would seem well to avoid the pretest and posttest 
design and to substitute for it a design in which 
equivalent groups receive a single test, preceded for 
the experimental group by exposure to the experi- 


mental variable. Schanck and Goodman (10) have 
been the first to express this point of view, and it is 
also implicit in Solomon's paper (11) on the exten- 
sion of control group design, although he does not fol- 
low it through to its logical conclusion. Hovland (4) 
has an excellent discussion of the pros and cons, and 
he favors retaining the pretest model where one 
wishes to study individual correlates of amount 
of change and where there is difficulty in selecting 
experimental and control groups randomly from the 
same population. With these limitations we would 
agree, but we would still emphasize that for the 
purposes of most propaganda effectiveness studies, 
the “posttest only” design is more appropriate, pro- 
viding the greater ability to generalize to the 
nonexperimental situation. 


Subjects 


Ten classes in elementary psychology served as the 
subjects in this experiment. Propaganda exposure 
and testing were completed in one class period. 
Eight of the classes listened to one or another of 
the four transcriptions; each of the four transcrip- 
tions was heard by two classes. The remaining 
two classes took the tests without any propaganda 
exposure and will be called the Control Group in 
the following discussion. As far as is known or as 
can be inferred from the data, there were no selec- 
tive factors involved in the occurrence of a student 
in one class or another. Assignment ‘of classes 
within the experimental design was randomized. 
All students took the tests anonymously. As far as 
could be discovered from later class discussions, all 
accepted the recordings as bonafide transcriptions 
prepared for station use. The students were in- 
formed later of the biased nature of the presenta- 
tions. No suspicions as to the intent of Tests I 
and II were expressed. To facilitate the use of 
analysis of variance, an equal number of cases for 
each of the five groups was achieved by a random 
discarding of excess cases. For the data analyzed, 
there were 47 cases in each of the five groups. 


REsutts 
Characteristics of the Tests 


Reliability of the tests was computed by 
a modification of the Kuder-Richardson 
Formula No. 14 (6) to make it appropriate 
for items with more than two levels of en- 
dorsement.* Intercorrelations among the 
three tests were also computed. Because of 
the possibility that exposure to the experi- 
mental conditions might modify these values, 


4 The authors are indebted to R. J. Wherry and R. H. 
Gaylord for the formula used. The modification involves 
essentially the substitution of item variances for the 
fq terms of the original formula. (Note that the Kuder- 
Richardson and equivalent reliability formulas under- 
estimate rather than overestimate reliability if the 
essential assumptions, including single-factoredness, are 
not met.) 
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TABLE | 


CHARACTERISTICS OF TEsTs 








Script 


RELIABILITY OF TEST 


INTERCORRELATION OF TESTS 








Il 


I vs. Il I vs. Ill II vs. Il 





Pro-Kuomintang Monologue 46 
Pro-Kuomintang Dialogue -48 
Anti-Kuomintang Monologue . -53 
Anti-Kuomintang Dialogue +35 
Control - 66 
Average . .50 
Average, corrected for attenuation 


.28 -35 
+13 +23 +25 
-28 -42 -36 
-28 i +22 
-03 28 -56 
+20 -36 -35 
-47 51 +53 


+34 





we have computed them separately for each 
of the five experimental groups. These values 
are presented in Table 1. The alternative 
procedure of computing reliability and inter- 
correlations on a pooled population would 
give exaggerated values where the experi- 
mental conditions have systematic effects on 
mean values. 

In interpreting Table 1 it may be noted 
that the reliability values for the direct test 
(III) are consistently high, as has in general 
been found for attitude tests involving direct 
voluntary self-description of attitudes. The 
reliability coefficients for the two indirect 
tests are low and show considerable fluctu- 
ation from group to group. This too is in 
line with previous experience (1). The 
values for the indirect tests are sufficiently 
high and consistent, however, to demonstrate 
some syndrome of response consistency 
among the responses to the items. The 
present authors feel that in revised tests, 
utilizing the experience of constructing and 
analyzing the present ones, higher reliabilities 
could be obtained. It is doubtful, however, 
that indirect tests can ever achieve the inter- 
nal consistency of direct ones. In the latter, 
the respondent is aware that he is portraying 


his own attitudes and may possibly exag- 
gerate their coherence. 

The intercorrelations among the three tests 
indicate that to some extent they are tapping 
a common syndrome of response tendencies, 
which we choose to interpret as “Attitude 
toward the Kuomintang Government of 
China.” This syndrome, of course, is prob- 
ably a part of a larger syndrome of attitudes 
which we have not explored. The corrections 
for attenuation in Table 1 estimate what the 
correlations would be if the tests were per- 
fectly reliable. If all three measured but a 
common factor, these would approach unity. 
It appears that, while about half of the con- 
sistent variance of each test is shared with 
the other tests (12), each is measuring some- 
thing specific to itself, also. The two indirect 
tests have no more in common with each 
other than does each with the direct test. 


Effects of the Experimental Variables 


The data were first analyzed using a two- 
way analysis of variance, involving four cells 
and not utilizing data from the control group. 
A separate analysis was made for each of the 
three tests. The results of these analyses are 
shown in Table 2. 


TABLE 2 


Variance ANALYsiIs OF EXPERIMENTAL EFFECTS 








I. INFoRMATION TEST 


CONTENT OF pe 
DIALocuE 


Script 





MONOLOGUE 


II. Optnton Estimates It]. Direct Test 





MonoLocuz DiuALocuve MonoLocuz DitaALocusg 





94.68 
110.08 


95.81 
107-94 


Pro-Kuomintang 
Antu-Kuomintang 
F ratio 
Pro-Anti 
Mono.-Dia. 
Interacuon I 


75-75 
-10 


.07 


28.72 
32-53 


27.00 
32-74 


52.72 
68.26 


53-44 
68.58 


31.78 
1.20 
-99 
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For all of these F ratios, a value of 3.89 is 
required for significance at the .05 level, 6.81 
for significance at the .o1 level; values above 
11.00 would occur by chance fewer than one 
time in one thousand. The values in the 
cells are means, provided to indicate the direc- 
tion of effects. Each cell mean is based upon 
47 cases. High scores are anti-Kuomintang. 

From inspection of the cell means in the 
table, as well as from inspection of the F 
ratios, it is apparent that the pro-anti slant of 
the propaganda made an important difference 
on all of the attitude measures. In all cases, 
the effect is significant beyond the .oor level. 
On the other hand, the form of presenta- 
tion—monologue versus dialogue—has made 
no appreciable difference on any of the three 
tests. (Note that the anticipated greater 
effectiveness of the dialogue method of pres- 
entation would have appeared as an inter- 
action effect in the above statistical analysis, 
making the pro cell lower and the anti cell 
higher in the dialogue sections than for the 
corresponding monologue cells.) This find- 
ing, while clear-cut in the statistical data, 
goes against the subjective judgments of the 
experimenters and colleagues who listened to 
2ll four transcriptions. For such persons, 
the dialogue présentation was universally 
favored. Our data did make possible one 
further comparison between the two styles. 
As one of the final assignments, the listeners 
were asked to express their opinions as to 
what proportion of the transcriptions were 
“propaganda.” For both the pro and the 
anti materials, there was a tendency for the 
monologue presentation to be seen as more 
propagandistic. A chi-square test applied to 
a fourfold condensation of the data on this 
item gave a value of 10.69, significant well 
beyond the .o1 level, and almost reaching the 
oot level. This difference is in marked con- 
trast to the quite convincing equal effective- 
ness of the two modes of presentation, as 
judged by the attitude tests. It is quite pos- 
sible that propaganda may be effective even 
when recognized by the audience as such (2). 
It should also be remembered that the ques- 
tion as to propagandistic content was asked 
after both transcription and attitude tests 
had been experienced and was thus a retro- 
spective judgment, from which it is hazard- 
ous to guess at the audience evaluation while 


listening. It must also be remembered that 
attitude-change processes are extended in 
time, with something like “consolidation” 
taking place, and that in such processes more 
general affective aspects of the situation may 
be important. One of the limitations of the 
present experiment is that the effects of the 
propaganda are measured immediately after 
exposure, and generalizations to more long- 
term effects are unwarranted from these 
data (4). In particular, the present authors 
would feel that if the monologue presentation 
is more often seen as propagandistic, it is 
likely that it will be less effective under some 
conditions. 

To introduce the control group into the 
analysis for a study of the relative effective- 
ness of the pro and anti transcriptions, a 
second analysis of variance was performed. 
This analysis treated all five experimental 
groups as categories within a single classifi- 
cation criterion. For comparison with the 
means presented in Table 2, the mean cell 
values for the control group on the three tests 
are: Test I, 98.77; Test II, 28.81; Test III, 
55-13. As would be expected from the initial 
plan of the experiment, these values are inter- 
mediate between the pro and anti groups. 
Using the error term derived from the one- 
way analysis of variance, the propaganda 
exposure groups were compared with the con- 
trol group. The ¢ ratios resulting from this 
comparison are shown in Table 3. 

The most impressive finding from Table 3 
is that the anti-Kuomintang propaganda is 
strongly effective, while the pro-Kuomintang 
propaganda has had little if any effect. In 
Table 3, a one-tailed test of significance has 
been used, inasmuch as the direction of the 
difference anticipated is clearly stated in the 
experimental design. By this token, all but 
one of the tests involving the anti-Kuomin- 
tang propaganda are significant far beyond 
the .oor level, and the one is close to it. 
There is no doubt as to the efficacy of these 
“anti” transcriptions. For the pro-Kuomin- 
tang propaganda, the case is much more 
ambiguous. In favor of a significant differ- 
ence is the general observation that all differ- 
ences are in the expected direction. However, 
only one of the six testings reaches signifi- 
cance at the .o5 level or better. But as the 
Pro-Kuomintang Monologue and the Pro- 
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TABLE 3 


Dirrerences Between PropacaNnpa Groups AND Controt Group 








I. INrorMATION TEST II. Opinion Estimates Ill. Direct Test 


t ? t Pp : P 











Script 





—1.00° -16 
—.70* +24 
5-44 <.001 
5.58 <.0o1 


—.08* -47 
—1.52° .06 
3.11 <.001 
3.06 <.002 


—1.77° -04 
—1.24° -II 
4-91 <.001 
3.98 <.001 


Pro-Kuomintang Monologue 
Pro-Kuomintang Dialogue 
Anu-Kuomintang Monologue 
Anti-Kuomintang Dialogue 





* Negative signs indicate that the mean of the group involved was lower than that of the control group. 
As anticipated in the design of the experiment, these negative values occur only for the Pro-Kuomintang propa- 


ganda groups. 


Kuomintang Dialogue are independent 
samples, it seems justified to multiply proba- 
bilities in ascertaining a joint likelihood of 
obtaining two such samples of the same size 
with differences of like direction and magni- 
tude. Doing this makes the results on the 
Information Test for both pro-Kuomintang 
propaganda groups combined significant at 
the .004 level. For the Opinion Estimate 
Test, the combined probability is .o3. For 
the Direct Test, the value is .o4. We would 
interpret this as signifying that as measured 
on the Information Test at least, and prob- 
ably for the other two tests, the pro-Kuomin- 
tang propaganda had a small but significant 
effect. 

The relatively greater effectiveness of the 
anti-Kuomintang transcriptions deserves 
some comment, particularly in light of the 
many shifts in public attitudes that have 
occurred since this experiment. For political 
scientists or others who would wish to study 
the scripts further, they are presented in the 
senior author’s thesis (8). We do not believe 
that the difference can be explained in terms 
of any greater enthusiasm on the part of 
either writer or speaker for one point of view 
as opposed to the other. As nearly as can be 


told, both points of view were presented 
authoritatively and persuasively. On the item 
discussed above, asking what proportion of 
the presentation was thought to be deliberate 
propaganda, no differences were found be- 
tween the pro and anti scripts. The odds are, 
in fact, a little in favor of the obtained scat- 
ter being a random distribution (p=.57). 
What little difference exists is in the direction 
of the pro-Kuomintang transcriptions being 
seen as more propagandistic. As the final 
task, the students were asked to choose 
among five alternatives in completing the sen- 
tence “The propaganda, if existing, repre- 
sents the point of view taken by... .” The 
results for the four propaganda exposure 
groups are shown in Table 4. 

While some confusion exists, the responses 
in Table 4 seem clearly to differentiate the 
two programs. ‘The identification of the 
anti-Kuomintang scripts with the Chinese 
Communist point of view is perhaps under- 
standable in terms of a general tendency for 
people to use “two-valued” logic or to think 
only in terms of polar opposites when view- 
ing the political scene. No such identifica- 
tion was intended in the composition of the 
script. In fact, the opening of the argument 


TABLE 4 


Guessep Potnt of View oF TRANSCRIPTIOY: 








Pro-KUOMINTANG 


Guessep Point or View MonoLocuE 


Pro-KvOMINTANG 
DIALocue 


ANnTI-KUOMINTANG 
Mono.Locur 


AnTI-KUOMINTANG 
D:aLocug 





. U. S. State Department 16 
. Kuomintang Government 20 
. Chinese Communists 

. U. S. Army 

. British Foreign Office 

. No response 





16 
19 
2 


I 
I 
8 
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of the anti-Kuomintang scripts claims that 
the Kuomintang government, through in- 
competence and corruption, is letting China 
fall into the Russian sphere of influence. 
Nor can we believe that there is any causal 
relationship between the greater effectiveness 
of the anti-Kuomintang transcriptions and 
the tendency to identify them as representing 
the Chinese Communist point of view; for 
the audience involved, it is judged that any 
such identification would have reduced the 
persuasiveness of the argument. One pos- 
sible line of explanation lies in terms of 
adaptation level (3) or satiation theory. 
Without making a detailed analysis, it is 
judged that the general line of the press at 
the time of the experiment and immediately 
before, upheld something like the pro-Kuo- 
mintang point of view. (In the time subse- 
quent, the press has seemed to shift first 
toward a more anti-Kuomintang position, 
corresponding to the worsening military 
situation of the Kuomintang forces and 
General Bradley’s report. More recently it 
has returned to a pro-Kuomintang orien- 
tation.) In Helson’s (3) terms, through 


contact with press and radio, the level of 


adaptation of our students for mass media 
expressions of attitude toward the Kuomin- 
tang government was toward the pro- 
Kuomintang side; the pro-Kuomintang 
transcriptions thus tended to be “neutral” 
and their effective stimulus value slight. To 
put it in other terms, while the attitudes 
of students were not wholly pro-Kuomin- 
tang, they had already been moved as far 
in a pro-Kuomintang direction as casual 
contact with mass media products was likely 
to move them. A “law of diminishing 
returns” statement would be equivalent: one 
additional pro-Kuomintang, anti-Communist 
statement added to a long series of such items 
could have little effect. At the time of the 
experiment, then, the anti-Kuomintang tran- 
scriptions offered more novelty, contrast, 
effective stimulus. There was more “room” 
for the students’ attitudes to be moved in an 
anti-Kuomintang direction. Such an inter- 
pretation, however, will have to remain 
conjecture insofar as the data available in the 
present experiment are concerned.° 


5 Data were available on one further point, of in- 
terest with regard to the general problem of attitude 


In an experiment such as this, the question 
is often asked: “You have shown that propa- 
ganda made a difference, but how much of 
a difference did it make?” This is a question 
which the ordinary statistics used in attitude 
change experiments cannot answer. What is 
needed is a means of changing the statistics 
of significance into statements of degree. 
Upon consultation with Robert J. Wherry, 
the point-biserial coefficient of correlation was 
selected for this purpose. Using the data 
from the analysis of variance, the correlation 
was computed between attitude test scores (a 
continuous variable) and type of propaganda 
exposure (with only two points, pro-Kuo- 
mintang and anti-Kuomintang transcrip- 
tions). (For this purpose, monologue and 
dialogue groups have been combined and the 
control group disregarded.) The point- 
biserial coefficient is appropriate when the 
data can be classified into two groups, but 
no inferences as to the difference between the 
two groups can be made, nor can they be 
regarded as two halves of a continuum. The 
values tend to run lower than an ordinary 
biserial correlation computed on the same 
data. This computation gave us for Test I, 
the Information Test, .54; for Test II, Esti- 





diagnosis, and of some relevance to the preceding dis- 
cussion. In a previous paper (1) it has been suggested 
that indirect indications of individual attitudes might 
be ascertained by asking persons to judge the relative 
bias or propaganda content of various argumentative 
passages. While an ideal test of this possibility in the 
present context would have required pre-propaganda 
measures of attitude, a correlation of attitude scores with 
judgments, as to the predominance of propaganda con- 
tent, would still seem of interest. Item 66 asked, “What 
percentage of the program, if any, contained deliberate 
propaganda?” Five alternatives were offered, ranging 
from 10 to 75 per cent. The responses to this item corre- 
lated .24 with the direct test attitude scores among those 
exposed to pro-Kuomintang transcriptions. This means 
that there was a slight tendency for those who were 
more anti-Kuomintang to see the transcription as more 
propagandistic. Among those who heard the anti- 
Kuomintang transcriptions, the value was —.13. As 
expected, the direction is reversed, and a slight tendency 
is found for the more anti-Kuomintang persons to see 
the transcriptions as less propagandistic. (The first of 
these values reaches the .05 level of significance; the 
second does not.) Item 67 continued by asking, “What 
was the percentage of unintentional or unconscious 
propaganda in the program, if any?” and offered the 
same range of alternatives. Combining the responses to 
Items 66 and 67 into a total, correlations of .25 and 
—.19 were obtained. (Both reached the .05 level of 
significance but not the .or level.) While, at a low 
level of significance, the general expectation is con- 
firmed, the strength of the relationship is so slight as 
to offer little promise of diagnostic usefulness. 
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mates of Group Opinion, .38; for Test III, 
the Direct Attitude Test, .55. These values 
are impressively high. As one might expect 
for a relatively unsalient attitude, the experi- 
mental condition has made a considerable 
contribution to the range of observed atti- 
tudes. For the Direct and Information Tests, 
roughly one-fourth of the variability of the 
pool of test scores can be interpreted as being 
contributed by the factor of propaganda 
type (12). 

Asking the question of degree rather than 
significance of effect can lead us also to the 
problem of which test was most sensitive to, 
or most influenced by, the propaganda. The 
three values given above indicate something 
of this, showing the direct test to be slightly 
more sensitive than the indirect information 
test. However, this may have been due to 
the much greater reliability of the direct test 
rather than to its greater appropriateness to 
the situation. What would the relative sensi- 
tivity of the tests have been had all been 
equally reliable? For answering this ques- 
tion, the traditional correction for attenuation 
seems appropriate. Using the average relia- 
bility values from Table 1, we obtain the 
following propaganda-test correlations cor- 
rected for the unreliability of the tests: Test I, 
.71; Test II, .54; Test III, .59. In these terms 
the Information Test seems most fully to 
reflect the propaganda exposure, with propa- 
ganda exposure accounting for roughly one- 
half of the reliable variance in test scores, 


SUMMARY 


Dummy radio transcriptions, both for and 
against the Kuomintang government, were 
presented in both monologue and dialogue 
form to four experimental groups of students. 
These groups, and an additional group hear- 
ing no transcription, were given two indirect 
attitude tests (one in an information-test 
format and one asking for estimates of group 
opinion) and one direct attitude test dealing 
with attitudes toward the Kuomintang gov- 


ernment. As measured by all three instru- 
ments, the anti-Kuomintang transcriptions 
were highly effective, the pro-Kuomintang 
ones marginally effective. The monologue- 
dialogue difference in presentation of ma- 
terials yielded no effect at all on attitude 
scores. The dialogue presentations were, 
however, less frequently judged to be propa- 
gandistic than were the monologues. While 
the three tests share significant variance in 
common, they also are measuring some non- 
overlapping components. The direct test is 
distinctly more reliable than the indirect tests. 
When these reliability differences are com- 
pensated for, the information test is judged 
to reflect most sensitively the propaganda. 
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NE of Freud’s early contributions (4) 
() was to demonstrate that the middle- 

class subsociety in Western culture 
suffers from the repression of sexual drives. 
Later he extended his analysis to include a 
variety of motives (5, 6). Now, when the 
body of Freudian theory is incorporated into 
social psychiatry, the following important 
question arises: Is there a relation between 
culture and the drives most likely to be 
inhibited ? ? 


‘THEORETICAL CONSIDERATIONS 


Perhaps the simplest approach to the prob- 
lem is in terms of the concept of motive. 
We may say that every individual adjusts 
by gratifying his motives, some of which are 
inherited, others acquired. Because of the 
plasticity of the human nervous system, man 
learns most of his adjustive behavior from 
the culture in which he participates. But 
no culture provides sadequate gratification 
for all the motives of its participants. ‘The 
primary hypothesis of this paper is that cul- 
tures vary in the areas of their inadequacy, 
i.¢., in the motives which are not adequately 
gratified by the culture. Evidence supporting 
this proposition is the relation found between 
social ecology and psychoses in our own 
society (3). 

Let us begin with inherited motives. As 
has been stated, Freud showed that the white 
middle-class subculture produces an inade- 
quate satisfaction of sexual motives. How- 
ever, other societies may have different 
motives which are inadequately gratified, 
such as hunger. Holmberg (13) has sug- 
gested that this is true of the Siriono. And 
in an incomplete study made in 1941 on 
lower-class Negro psychotics in St. Louis, 
I concluded that the basic factor in their 

1This is a summary of a research project carried 
out under a grant from the Social Science Research Com- 
mittee of the University of Chicago. 

2“Inhibition” is used here as a general category to 


cover both conscious suppression and unconscious re- 
pression, 


personality disorganization was economic 
deprivation. 

It may also be argued that some cultures 
prevent the adequate gratification of acquired 
motives. In this paper I will limit myself 
to one of these, namely, the inadequate ex- 
pression of social opposition. This has al- 
ready been noted by Hallowell (7, 8, 9, 10, 
11, 12) and Landes (16, 17) for the Ojibwa,’ 
and Eggan (2) for the Hopi; they have shown 
that these Indian cultures inhibit aggression. 
I would like to pursue this point further, 
with reference to the Menomini. 

In another work (19) I have discussed har- 
monious (cooperation and collaboration) and 
opposed (competition and conflict) types of 
social interaction. Both exist in every group. 
But intragroup opposition tends to disrupt 
the solidarity of the group; therefore, it is 
controlled in a persisting group. It is par- 
tially inhibited and partially expressed in cus- 
tomary channels. The two commonest forms 
of such expressions are opposition toward out- 
groups, as in warfare; and periodic license 
within the group, as in the Roman Carnival 
described by Goethe. However, in every 
culture the individual depends upon some 
primary groups—particularly the family of 
orientation—for most of his fundamental ad- 
justments. The solidarity of such groups has 
such high social value that very little, if any, 
opposition is permitted within them. In our 
own complex society the family organization 
is nuclear and independent, and there are 
relatively féw other fundamental primary 
groups, so that social opposition is expressed 
—and, in fact, given high social value—in 
many relationships of a secondary nature. 
The situation is quite different in simple soci- 
eties which conceive of themselves as con- 
stituting a single fundamental primary group, 
for in such cases minimal expression of social 


81 knew of the papers on a related people, by Hallo- 
well and Landes, but deliberately left them unread 
until I had finished my paper, in order to have an 
independent check on my findings. The similarity in our 
observations is encouraging. 
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opposition is permitted. This seems to be 
true of the Menomini. 

I will go a step further, and maintain that 
no culture provides for the adequate ex- 
pression of such opposition. Indeed, I would 
suggest that the inhibition of opposition in 
basic primary groups is more general than 
inhibition in the classic sexual form. Even 
our own culture produces much repressed 
opposition towards parents and siblings; it 
is, I submit, a fundamental factor in most 
cases of neuroses and psychoses found in our 
own culture. If this is so, the sexual form 
often taken by such repressed opposition, as 
in the Oedipus and Electra complexes, is 
simply due to displacement. 

The phenomenon of intragroup opposition 
can be studied more easily in simpler societies. 
Therefore let us turn to the Menomini.* 


Tre Mores oF THE MENOMINI 


The life goal of the Menomini is to live 
well® and quietly. This is achieved, in the 
last analysis, by treating others as if they 
were members of one’s own family.® People 
should cooperate with each other and help 
each other; they should not compete or en- 


gage in verbal or physical conflicts. 

In order to avoid occasions for offense, 
people are supposed to be circumspect in 
their behavior. Consequently there is little 


spontaneity in social interaction. People 

* Many of the observations given in this study have 
been described and interpreted for other American 
Indian tribes. For instance, Linton (18) has con- 
sidered some of them as due to the disorganization 
resulting from acculturation; Kluckhohn (15) has treated 
them functionally; and Barnouw (1), historically. I am 
not questioning the validity of these approaches; the 
purpose of this paper is simply to suggest an additional 
point of view. 

5 This includes long life, good health, enough food 
to eat, and new clothes. 

® My fieldwork, 1949-1951, was done in Zoar, the 
conservative community on the Menomini Reservation, 
Wisconsin. Since the group is small (consisting of 168 
people during the summer of 1950), and the kinship 
terms are predominantly classificatory, almost everyone 
is believed to be related to everyone else. Therefore 
it is not surprising that the people are supposed to 
treat each other as members of a single family. 

There are two dominant religious cults in Zoar: the 
Dancing Rite (known to anthropologists as the “Dream 
Dance,” and called the “Powwow” by English speaking 
Menomini), introduced in 1879; and Peyotism, adopted 
in 1914. The members of both groups believe that 
the ethic here described traditionally applied only within 
the tribe, and that warfare between tribes was common; 
but with the introduction of their own cult, this ethic 
was extended intertribally as well. 


should be undemonstrative, both in their 
positive and negative emotions. They are 
expected to reflect before talking, and then 
to speak sparingly and in a subdued manner. 
Anyone who speaks hastily, loudly, and at 
length is considered to be behaving like a 
child. 

Competition is frowned upon. All members 
of the community a-e supposed to be on a 
par; no one should be thought of as better 
or worse than any other. To become out- 
standing means that others will be jealous; 
this makes one the object of criticism and 
witchcraft. 

The people are expected to refrain from 
verbal conflict. There should be no quarrel- 
ing, criticism, or abuse * of a listener. Ideally, 
one should not even criticize or gossip about 
an absent person. This prohibition of verbal 
conflict extends to not answering such re- 
marks by another, and the avoidance of situ- 
ations in which such opposition occurs. Many 
sanctions are used against those who engage 
in verbal conflict: disapproval, unpopularity, 
loss of status, witchcraft, and supernatural 
punishment by spirits both in this life and 
after death. 

Physical conflict is similarly prohibited. 
One should not start a fight. If someone 
else does, the proper thing to do is to turn the 
other cheek and walk away. The sanctions 
applied against physical conflict are the same 
as those against verbal conflict. 

One must be particularly circumspect 
towards the elders. In the first place, the 
political organization of the community is 
gerontocratic. Second, the elders are respected 
because of their superior wisdom due to their 
greater experience. Last, and most impor- 
tant, they are the experts in the use of medi- 
cines, both good and bad, and when offended 
by others may be expected to retaliate by 
injuring or killing the offender through 
witchcraft. 


Customary Forms of Expression of Social 
Opposition 

It seems to me that no persisting culture 

can completely inhibit the expression of 

fundamental drives; the resulting maladjust- 

ment would be too disastrous to the society. 

7 There seem to be no contemporary Menomini swear 


words. I never heard anyone in Zoar swear, cither 
in Menomini or English. 
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Consequently, cultures provide customary 
forms for the expression of otherwise in- 
hibited motives by means of channelized 
kinds of direct gratification, sublimation, and 
violation. 

There are a few channelized forms of 
direct opposition which are acceptable to the 
Menomini, namely, teasing and competitive 
games. In the past there presumably were 
definite joking relations between kin, but 
teasing is now general. Very little of it 
strikes the observer as good-natured; most 
teasing is half in jest and whole in earnest, 
with nasty digs the rule. The importance 
of teasing as an acceptable form of expressing 
social opposition is indicated by its prevalence 
and the elaborate terminology used to describe 
it. Teasing is given two justifications: it is 
defended on the ground that one is only 
joking, or as the politest way of expressing 
disapproval. 

Some competitive gamies are played. Of 
the traditional ones, only the moccasin (sim- 
ilar to the white man’s shell game) and 
lacrosse games seem to persist. The former 
is played occasionally; no instance of it oc- 
curred during my stay. The latter is played 
ritually; there were two games during the 
summer of 1950, the only ones which occurred 
after a lapse of some years. Characteristically, 
members of the losing side afterwards mut- 
tered that the winners had cheated, and made 
snide remarks about the outstanding players 
on both sides. As to white games, the chil- 
dren play baseball, which they learned at 
school; the men play poker. However, it is 
considered immoral to play these white 
games too often; inordinate play leads to 
punishment by spirits. 

I found two major forms of customary 
sublimation. One was the popularity of folk- 
lore in expressing hostility. Every elder who 
taught me Menomuini traditions somehow 
made sure that he or she told me the flood 
story. In each instance the greatest elabora- 
tion and most satisfaction were evinced at 
the point wheze all mankind is destroyed 
except the few members of the single family 
who heeded the warnings of the Great 
Spirit. All the other narratives volunteered 
dealt with characters who kill their opponents 
by trickery. 

Another form of customary sublimation is 


that providing relatively subtle expressions 
of hostility toward the white man. There are 
innumerable legends of the ways in which 
the Menomini have been exploited by whites, 
and they are told with resentment. The 
people have adopted some nativistic move- 
ments based on prospective myths involving 
the destruction of all white men; these are 
kept a secret from whites. Finally, even the 
exoteric customs having high social value, 
including the rites, are not to be told to white 
men; revealing such information is punished 
by the spirits with death. 

There seem to be three forms of customary 
violation: witchcraft, drinking, and gossip. 

All agree that witchcraft is now not as 
common as it was a few decades ago; this is 
probably due to acculturation. But everyone 
is afraid of “bad medicines,” and anyone who 
wants to inflict harm on another can either 
use these medicines himself if he owns them, 
or pay someone who does. The use of bad 
medicines is immoral. People owning them 
rarely acknowledge the fact publicly, and 
those who go to such people keep it a secret. 
The people believed to be witches are invari- 
ably older men and women who express a 
noticeable amount of social opposition. This 
belief is a means of expressing community 
disapproval of their behavior, and as such 
acts as a social control. 

Alcohol releases inhibitions with a venge- 
ance, as far as these people are concerned (14). 
Drinking usually begins Friday night and 
lasts through Sunday; living in the com- 
munity as I did, weekends became a night- 
mare to me. The people become individu- 
alistic and irresponsible, boisterous, boastful, 
quarrelsome, and bellicose. Afterwards there 
are few pangs of conscience.* At first I was 
struck by the fact that one or two drinks 
seemed enough to make them drunk. But 
after watching more carefully and hearing 
those who had been aggressive under the 
influence of liquor later narrate in detail what 
had occurred when they were supposedly 
so drunk as not to know what they were 
doing, I concluded that they often were not 
as drunk as they pretended to be. They 
seemed to be using the occasion as an excuse 
to express opposition which is customarily 
inhibited. Later I found that the people 


8 This ties in with the absence of manic-depression. 
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themselves often remarked to the same effect. 

The community abounds in vicious gossip. 
Some peddle it more subtly than others, but 
all constantly indulge in it. Both sexes be- 
lieve that gossip is common among women 
and absent among men. According to my 
observations, it occurs in both sexes, but with 
differences in degree and form. Gossip of 
all kinds is more prevalent and more frankly 
such among women—for what reason I have 
not been able to determine. The men in- 
variably preface derogatory remarks with the 
equivalent of “I don’t like to say anything 
against anyone, but... .” Gossips ration- 
alize their behavior. In the case of unfavor- 
able incidents, they consider that they are 
obliged to tell the whole truth. When one 
happens to say something favorable about 
anyone but close kin, the narrator usually 
drags in some unfavorable particulars as well 
in order to show objectivity. When telling 
an unfavorable incident, the facts become an 
outline which is embroidered with whatever 
fantasies are suggested by the gossip’s hos- 
tility. In the case of derogatory rumors felt 
to be probably baseless (I never heard com- 
plimentary rumors), the gossip relieves him- 
self or herself of responsibility by identifying 
the source, sometimes giving the name of the 
person from whom the rumor was received, 
at other times being vague. The rumor is 
then told ostensibly to elicit more definite 
information from the listener. 


Symptoms of Cultural Inadequacy 


In order to establish that the customary 
forms of expressing social opposition are in- 
adequate, symptomis of such inadequacy have 
to be established. In another work® I have 
discussed cultural inadequacy, and arrived at 
the following indicators: preoccupation with 
the problem situation, questioning of custom, 
social unrest, increased nonconformity, break- 
down of social controls, social disorganization, 
and personality disorganization. 

Because of repression, the inadequate ex- 
pression of social opposition is not itself per- 
ceived as a problem. Instead, there is a 
preoccupation with two symptoms, drinking 
and gossip. Exhortations to children by par- 
ents and grandparents continually emphasize 


Social and cultural change. Unpub- 


® Scorkin, J. S. 
lished manuscript. 


that one should neither drink nor gossip, and 
these also form the invariable themes of the 
preaching to adults at the various rites. When- 
ever I inquired about community problems, 
drinking and gossip were volunteered as the 
most serious internal problems. 

I never heard anyone question the morality 
of inhibiting social opposition, not even in 
answer to leading questions. I assume that 
this shows that the cultural, social, and psy- 
chological controls which inhibit such oppo- 
sition are still fairly effective. Nor did I 
notice any social unrest because of this 
inhibition. Presumably this is because the 
situation has existed for so long that the 
responses to it have all become patterned. 

A more general explanation for the lack of 
the last two indicators of inadequacy men- 
tioned is the social mobility to be discussed 
later under social disorganization—those 
people who are most disposed to skepticism 
and social unrest leave the community. If 
those who are inclined to reject the mores 
emigrate, it is to be expected that the culture 
will remain relatively undisturbed. 

Other indicators of cultural inadequacy 
are increased nonconformity and the break- 
down of social controls. These are certainly 
in evidence. But in the face of the Me 
nomini’s increasing intersocialization and 
acculturation, and the resulting cultural dis- 
organization, I hesitate to assess the extent to 
which increased nonconformity and the 
breakdown of social controls may be laid to 
the inadequate expression of social opposition. 

In some degree the same difficulty arises in 
regard to social disorganization. But some 
aspects, namely social mobility and weakened 
group solidarity, are clearly affected by what 
may be called the consequences of inhibited 
social opposition. 

A number of people have left the com- 
munity in the past, and some are now con- 
templating such a move. They usually settle 
in the more acculturated village of Neopit, 
which is on the reservation, but occasionally 
they leave the reservation entirely. In the 
cases where the reasons are known to me, it 
is because they want to escape from the con- 
sequences of inadequacy which are being con- 
sidered: drinking and its results, gossip, 
jealousy, and factionalism. 

The members of the community are aware 
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of their weak group solidarity. The ritual 
preaching usually contains a reference to the 
fact that they do not really behave as mem- 
bers of one family should. The people often 
speak about the difficulty of achieving ade- 
quate collective action. 

Because of factionalism, the whole com- 
munity never assembles or reaches consensus. 
There are different and supposedly mutually 
exclusive religions in the community: mem- 
bers of the Powwow, Peyotists, and a few 
Catholics. The Catholics call the others 
“pagans,” which is resented as the deadliest 
of all insults. The Peyotists consider them- 
selves to be Christians; they look upon the 
Powwow people as backward, and upon the 
Catholics as Christians who do not practice 
what they preach. The Powwow members 
view the Peyotists as people who partake of a 
terrible drug, and the Catholics as contempt- 
ible people who have rejected the Indian way 
of life and adopted that of the white man. 
Each religious group has a body of unfavor- 
able rumors and jokes about the other. 

There are also subdivisions within each of 
the religious groups. The Powwow and 
Peyote groups have four or five leaders each, 
and there is ill-concealed rivalry between the 
leaders within each rite. The other members 
tend to give their allegiance to one or another 
of the leaders. Now, not only are a leader’s 
jealousies often adopted by his partisans, 
but the latter also contribute their own 
hostility toward other leaders and their fol- 
lowers. Then too, some are not leaders but 
think they should be, and consequently envy 
those who are. In the Powwow there is a 
split between the elder, more conservative 
people, and the younger folk who consider 
themselves to be more progressive, i.e., better 
able to cope with current conditions of inter- 
socialization and acculturation. The few 
Catholics are divided into those who have 
non-Catholic close kin and therefore feel 
more sympathetic with the Powwow or 
Peyotism, as the case may be, and those whose 
close kin are exclusively Catholic. The lat- 
ter’s strongest ties are with other Catholics on 
the reservation, but outside the community. 

The solidarity within the family of orienta- 
tion is not too strong. This is most obvious 
in the conflict between parents who conform 
to the relatively traditional mores, and chil- 


dren who adopt more of the white man’s 
culture. I am not sure whether tl..s conflict 
is due to social opposition on other grounds, 
which is displaced to the subject of mores, or 
whether it is simply a product of accultura- 
tion. When we turn to the relationships 
between offspring, there are the usual overt 
manifestations of sibling rivalry in childhood. 
Among adults this rivalry is more subtle, but 
present nevertheless. For instance, it is sig- 
nificant that there is little quarreling and 
fighting between parents and adult children 
when drunk, but it often occurs between 
adult siblings. 

There is also weak solidarity in the family 
of procreation. This is shown in figures 
dealing with dissolution. Of the 55 adults in 
the community who either are or have been 
married, 19 have been divorced one to four 
times; 5 others, married once, were separated 
in the summer of 1950. Another point may 
also be indicative of weak solidarity. There 
is no emphasis on premarital chastity; in fact, 
since the reservation was established the 
white officials have bewailed the “immorality” 
of the Menomini. Yet married people are 
highly suspicious of adultery by their spouses. 
No married woman of any age is supposed 
to be alone with a man who is not her closest 
kin; if it is necessary to be with any other 
man, she takes a chaperon with her—her 
husband, an older child, or female friend. 
As Whiting (20, p. 76) has suggested for the 
Paiute, among the Menomini this may be 
interpreted as either directly indicative of 
weak solidarity between spouses, or the dis- 
placement of opposition from the family of 
orientation to the family of procreation. Evi- 
dence for this may be found in three facts. 
First, after an adulterous affair the wayward 
spouse is almost joyfully accepted when he 
or she returns home, so that the idea of 
exclusive sexual possession does not seem to 
be the basis for suspicion of adultery. Second, 
when spouses are drunk they usually quarrel 
or fight with each other; we have seen that 
this does not occur between parents and chil- 
dren. Third, in all cases where there is 
strong solidarity between spouses, they either 
no longer have living ‘parents and siblings, 
or have rejected them. 

There is a significant belief regarding both 
types of families. Witches have to “feed” 
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their medicine bundles by killing some mem- 
ber of their family annually. 

No close friendships exist, and the people 
acknowledge this fact. Irrespective of how 
friendly a couple of men or women might 
seem to be, they invariably volunteer in pri- 
vate the usual type of derogatory gossip about 
each other, and I either saw or heard of 
drunken quarrels or fights between them. 

From this analysis of the degree of soli- 
darity existing in the various groupings found 
in the community, let us turn to the way a 
collective activity is carried out. Such activi- 
ties are considered to require a leader, chosen 
by consensus. When a group project is 
undertaken, the first one who volunteers to 
be a leader is usually accepted. Those better 
qualified do not want to compete against the 
volunteer, while the rest are reluctant to say 
anything against him for fear of starting an 
argument. ‘Therefore, whatever leader is 


chosen, there are always some people who are 
disgruntled. Indeed, even the most qualified 
hesitate to accept leadership, since they know 
they will be the target of backbiting once they 
stand out from the group, and will be per- 
sonally blamed for all the difficulties and 


failures which may ensue. Accomplishments 
are attributed to the group as a whole, the 
leader being given no credit for any success. 

Once the collective activity is begun, there 
is little esprit de corps and low morale. There 
is suspicious speculation as to the motives 
of others, particularly the proposers of the 
activity, and skepticism as to the degree to 
which others will participate. Thus a person 
feels free to perceive individualistically the 
common situations confronting them, and 
prefers to sacrifice the common goal to his 
own ends. As a result, none of the members 
wholeheartedly participate in a collective 
activity, and success is moderate at best. After 
the activity is over, each member privately 
proceeds to blame others for whatever failure 
occurred. 

This lack of solidarity even affects the 
accepted political organization and status 
system. The political organization of the 
group is gerontocratic. The elders are sup- 
posed to have superior wisdom because of 
greater experience, and they control the most 
serious human sanction, witchcraft. But 
even the elders pay for their position. The 


middle-aged are eager to pounce upon gaps 
in the knowledge of any elder, and make 
sarcastic comments about his ignorance. Vio- 
lations of custom by elders are favorite items 
of gossip, for they, of all people, should know 
better. And when there is nothing else to 
criticize, one can always hint darkly that 
some old man or woman is probably a witch. 
The elders are jealous of each other. They 
are usually not blatant about it, but their 
attitudes are revealed in occasional remarks 
and actions. When under the influence of 
liquor, each of those who drink has boasted 
to me—in confidence, of course—of his or 
her superiority over the other elders. 

It is equally perilous to stand high in the 
status system. One man has prestige as the 
best hunter in the community; others sourly 
comment that they have sometimes seen him 
return without game; anyway, he drinks too 
much. A few men are excellent singers and 
dancers; I have been assured that they aren’t 
devout at all, but participate in the rites 
simply in order to show off. A few years ago 
one family built the best house in the com- 
munity; I was told that it was not surprising 
that the parents and all the children had died 
within two years, for such pretension is 
bound to make witches jealous. 

We now come to the last indicator of cul- 
tural inadequacy, personality disorganization. 
The proportion of such disorganization is 
high. There are three cases of institutional- 
ized psychotics, and, significantly enough, all 
were diagnosed as paranoid schizophrenes. 
Two other cases of the same type are known 
to me. From conversations, I gathered that 
at least a half dozen others have had psychotic 
episodes in the past and, with one exception,’° 
these also seem to have been cases of paranoid 
schizophrenia. Neurotics seem to be com- 
mon; mere observation made me suspect that 
there are at least two dozens. I tried to make 
case studies of the psychotics and neurotics, 
but there is a traditional reluctance to discuss 
one’s private affairs with anyone (which is 
itself a symptom), and I could not find any 
way of motivating most people to violate this 
custom."* Therefore there is a serious gap in 

10 This is a case of reactive depression. 

11] could obtain an appreciable amount of material 
on only one psychotic; this case is inconclusive. (SLoTKIN, 


J. S.A case of paranoid schizophrenia among the 
Menomini Indians. Unpublished manuscript.) I did 
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my data, for I do not know the specific etiol- 
ogy producing most Menomini neurotics and 
psychotics. 

I was able to obtain a few scattered bits of 
relevant data. There exists at least some 
fantasy involving social opposition: a few 
people mentioned day and night dreams in 
which they talked back to, or struck, people 
against whom they harbored resentment. 
Again, it is precisely those who impressed 
me as having the greatest degree of inhibited 
opposition who are most fearful of witches 
and evil spirits; this may be interpreted as 
projected hostility. If suicide is considered 
to be displaced hostility, it is suggestive to 
find that the Menomini have a term for it. 
However, only three cases of suicide have 
occurred on the reservation in the last fifty 
years, one of which was in Zoar. All that I 
could learn of the latter, which happened a 
few decades ago, was that the victim had a 
reputation of being “suspicious” and “queer.” 
There has been one case of murder. A Zoar 
woman who is notoriously paranoid killed 
her white lover during a drunken quarrel, 
while living off the reservation. In fact, 
behavior while drunk seems to provide some 
of the most important supporting data for 
my hypothesis. If it is assumed that alcohol 
releases inhibitions, it is significant that com- 
petition and conflict occur under its influence, 
as mentioned previously. 


CoNCcLUSIONS 


The Menomini mores inhibit social oppo- 
sition. This produces symptoms of cultural 
inadequacy. 

When this finding is compared with the 
little material available on other cultures, it 
seems reasonable to infer, tentatively, that 
inhibited drives tend to vary with different 
cultures. What is now needed is a series of 
detailed psychiatric case studies made on 
neurctics and psychotics in unacculturated 





Not use projective tests, because G. D. Spindler spent the 
summers of 1949 and 1950 administering them. It will 
be interesting to learn the extent to which his findings 
confirm my own, 


exotic societies. Only in this way can we 


obtain direct evidence by which to test the 
hypothesis. 


REFERENCES 


1. Barnouw, V. Acculturation and personality among 
the Wisconsin Chippewa. Memoirs of the Ameni- 
can Anthropological Association, 72. Menasha, 
Wis.: American Anthropological Association, 1950. 

. Eocan, Dororny. The general problem of Hopi 
adjustment. Amer. Anthropologist, 1943, 43, 
357-373- 

. Faris, R. E. L., & Dunnam, H. W. Mental dis- 
orders in urban areas. Chicago: Univer. of 
Chicago Press, 1939. 

. Freup, S. Studies in hysteria. (Trans. by A. A. 
Brill.) New York: Nervous and Mental Disease 
Publ., 1936. 

. Freup, S. The ego and the id. (Trans. by J. 
Rieviere.) London: Hogarth, 1927. 

. Freup, S. The problem of anxicty. (Trans. by 
H. A. Bunker.) New York: Norton, 1936. 

. Hattowext, A. I. Psychic stresses and culture pat- 
terns. Amer. ]. Psychiat., 1936, 92, 1291-1310. 

. Harrower, A. I. Fear and anxiety as cultural and 
individual variables in a primitive society. ]. soc. 
Psychol., 1938, 9, 25-47. 

. Hattowert, A. I. Sin, sex and sickness in 
Saultcaux belief. Brit. J. med. Psychol., 1939, 
18, 191-197. 

. Harrower, A. I. Aggression in Saulteaux society. 
Psychiatry, 1940, 3, 395-407. 

. Hartowext, A. I. The social function of anxiety 
in a primitive society. American sociol. Rev., 
1941, 6, 869-81. 

. Harrower, A. I. Some psychological characteristics 
of the Northeastern Indians. In F. Johnson 
(Ed.), Man in northeastern North America. 
Papers of the Robert S. Peabody Foundation for 
Archaeology, No. 3. Andover, Mass.: Phillips 
Academy, 1946. Pp. 195-225. 

. Hormperc, A. R. Nomads of the Long Bow. 
Publications of the Institute of Social Anthro- 
pology, Smithsonian Institution, No. 10. Wash- 
ington: Government Printing Office, 1950. 

- Horton, D. The functions of alcohol in primitive 
societies. Quart. J]. Stud. Alcohol, 1943, 4, 
199-320. 

. Krucxonn, C. Navaho witchcraft. Papers of the 
Peabody Museum, No. 22.2. Cambridge, Mass.: 
Peabody Museurn, 1944. 

. Lanves, Rutw. The personality of the Ojibwa. 
Charact. & Pers., 1937, 6, 51-60. 

. Lanves, RutH. The abnormal among the Ojibwa 
Indians. J. abnorm. soc. Psychol., 1938, 33, 
14-33. 

. Linton, R. Acculturation in seven American Indian 
tribes. New York: Appleton-Century, 1940. 

. Storxin, J. S. Social anthropology. New York: 
Macmillan, 1950. 

. Wuirinc, Beatrice B. Paiute sorcery. 
Fund Publications in Anthropology, 
New York: Viking Fund, 1950. 


Received February 13, 1952. 


Viking 
No. 15. 





Tre Journat oF ABNORMAL AND SociAL PsycHcLocr 
Vol. 48, No. 1, 1953 


SOME DETERMINANTS OF APPARENT SIZE* 
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EVERAL years ago, Bruner and Good- 
S man (1) reported an experiment indi- 
cating that the apparent size of objects 

was accentuated when such objects were of 
value to the ten-year-old children who were 
subjects (Ss) in their experiment. Generally, 
the greater the value of coins, the more was 
their size overestimated. The effect was 
found to be more marked for poor children 
than for those from well-to-do homes. Over- 
estimation as a function of value and need 
was found to be present both when coins 
were present as stimulus objects and when 
their sizes were judged from memory. No 
comparably significant tendency was evident 
when coin-sized paper discs were presented. 
Carter and Schooler (2) repeated this ex- 
periment with several significant variations 
and their findings differed from Bruner and 
Goodman's in several ways. They found that 
there was a tendency for both larger discs 
and larger coins to be overestimated relative 
to smaller discs and coins. Inspection of their 
data shows, however, that the range of “dis- 
tortion” is considerably greater for coins than 
for either the cardboard or aluminum discs 
used by them. Thus, judgments with coins 
present showed an average distortion ranging 
from 5 per cent underestimation for the 
penny to 10 per cent overestimation for the 
half-dollar. There is a difference here of 
15 per cent (diameter) in apparent size be- 
tween the coin of lowest value and that of 
highest value used in the Carter and Schooler 
experiments. For aluminum discs, the cor- 
responding figures are 1 per cent underesti- 
mation for the penny size and 5 per cent 
overestimation for the half-dollar size—6 per- 
centage points. For cardboard discs, the 
separation is 4 per cent underestimation for 
penny size and 4 per cent overestimation for 
half-dollar size, a separation of 8 percentage 
1 This research was financed by a grant from the 
Laboratory of Social Relations at Harvard. The work 
reported here was carried out over a two-year period in 
the course of which we have benefited from discussions 


with Drs. Leo Postman, David McClelland, Robert A. 
Harris, and Frederick Mosteller. 
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points. Thus, while the tendency toward 
such distortion is present in both coins and 
discs, it appears to be more marked for the 
former than for the latter. We shall have 
occasion to return to the question of distor- 
tion range in the analysis of the experiments 
to be reported below. 

As for the difference between “rich” and 
“poor,” admittedly a poor operational defi- 
nition of need for coins, Carter and Schooler 
found that there was a statistically significant 
tendency for the “poor” to distort more than 
the “rich” only when judgments are made 
from memory. When the coins are present 
to view during judging, the differences be- 
tween the groups are not statistically reliable, 
though for all values of coins, they are in the 
expected direction, the “poor” overestimating 
coin sizes more than the “rich.” Such a 
tendency is not found for the discs. 

The present experiments seek to explore 
some of the factors which might conceivably 
have produced divergencies in the findings of 
these two pairs of experimenters. As Carter 
and Schooler point out, there are many things 
that may enter into size judgment, a fact 
attested to by the literature on size constancy 
(e.g., Gibson [4]). We felt that a fruitful way 
to proceed would be to study specifically those 
elements in the experimental procedure of the 
two investigations which were unlike in order 
to determine whether in the respective pro- 
cedures there might lie a basis for the differ- 
ence in results. 

Some of the more conspicuous differences 
in procedure can be summarized as follows: 

a. Shape of variable light patch set by Ss 
to match the sizes of coins and discs. Carter 
and Schooler used a circular patch of light, 
Bruner and Goodman a nine-chorded one 
cast by a conventional iris diaphragm. 

5. Placement of coins and discs, Carter 
and Schooler laid these on the table next to 
the judging apparatus; Bruner and Goodman 
permitted their Ss to hold them in their hand, 
resting hand on table next to the apparatus. 

c. Possible attitudinal differences. We 
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thought that, possibly, one set of Ss might 
have adopted a strong accuracy set, another 
a more evaluative or “value” oricnted set. 
There was no particular reason to believe 
this to be the case, but we felt it might be 
worth investigation. 

The above-mentioned factors, then, were 
taken into consideration in the design of the 
experiment along with certain constant fac- 
tors discussed below. No effort has been 
made to explore systematically the difference 
between the rich and the poor. As a psycho- 
logical variable, economic status by itself 
leaves much to be desired. Other studies by 
Rosenthal (8), Lambert, Solomon, and Wat- 
son (6), Klein, Schlesinger, and Meister (5), 
and McClelland and Atkinson (7) do a far 


Attitude preconditioning. Half of the chil- 
dren in each group were, by a procedure 
noted below, set to think of the buying power 
of money (the Value Group) and the other 
half were set to think about accuracy of size 
judgments of objects in general (the Accu- 
racy Group). 

Variable light patch. One-third of the Ss 
used a variable six-chorded light figure for 
matching the sizes of coins or discs (Six- 
Chord Group), another used a variable nine- 
chorded light figure (Nine-Chord Group), 
and a last third used a circular light figure 
(Circle Group). 

General randomization. Order of presen- 
tation of the different sizes of coins was ran- 
domized within each subgroup, as was the 
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better job of scrutinizing the need factor in 
judgments of apparent size than do either 
Bruner and Goodman or Carter and Schooler. 
Rather, we shall content ourselves here with 
the study of how the economic, social, or sym- 
bolic value of coins and discs affects their 
apparent size in a group of children “of 
middle income”--whatever that may mean 
psychologically. 


EXPERIMENTAL DesIcGNn 


The following variables were manipulated 
systematically in our experimental design: 

Value of coin. One group of Ss was pre- 
sented a penay, a nickel, and a quarter to 
judge; another judged correspondingly sized 
nickel-colored white metal discs; a third 
judged gray cardboard discs. We have then 
a first subdivision of Ss into Coin, Metal, and 
Paper Groups. There were 60 Ss in the Coin 
Group and 30 in each of the Disc Groups. 
The children were, of course, assigned at 
random to the three groups. 


size of the variable light figure from which a 
match began. 

Placement of disc and coin. Half the Coin 
Group held the coin in their hand (as in the 
Bruner-Goodman procedure); the other half 
had it placed on the table (as in the Carter- 
Schooler experiment). For certain purposes 
we shall treat both halves of the Coin Group 
together. For others, we shall separate them, 
for there were some important differences 
between them. 

The design, comprising 120 Ss, is presented 
in Table 1. 

An additional 30 Ss were run in a special 
experiment, judging circles of light. This 
group will be described later. 


METHOD 
Subjects 


Our Ss were drawn from the fifth and sixth 
grades of two public grammar schools in one of 
Boston’s middle-income suburbs. The great ma- 
jority were ten years of age, the remainder were 
eleven. The mean age of each of our various sub- 





Some DererMINANTs OF APPARENT SIZE 19 


groupings was approximately alike. The teacher 
introduced the experimenter to a class and assured 
them that the tests he would give would have noth- 
ing to do with their school work and that she would 
not know the results. Children were taken singly 
to another room, an empty classroom, for testing. 
These rooms were in shaded daylight illumination. 


Procedure 


The child was seated at a table before the appa- 
ratus, described below, and an effort was made to 
put him at ease by assuring him that this was really 
not a test at all since there were really no right or 
wrong answers, “but just judgments.” If a child 
was to be assigned to the Value Group, he was 
given some close variant of the following treat- 
ment—the variation depending, of course, upon the 
rapport attained: “Before we start, let me ask you 
a few questions about what you can do with money 
and what you do with yours. Suppose I were to 
give you a penny now and we were down at the 
store. What would you buy with it? How about 
a nickel? A quarter?” On the average, five to ten 
minutes of conversation were given over to dis- 
cussing desirable things—mostly, it turned out, 
candy—to be purchased for a penny, nickel, or 
quarter. 

Those Ss assigned to the Accuracy Group were 
given quite different treatment. Some variant of 
the following instruction was given: “Before we 
start, let me ask you a few questions. I want to 
see how good you are at judging the size of things. 
How about this ring (pointing to a rectangular 
signet ring), for example.” The children judged its 
length and width in fractions of an inch. Then 
they were asked to give the length and width of a 
3 by 5-in. card held before them, and finally, the 
dimensions of a large keyhole in the door. 

All Ss were then told, “Now let's get down to 
work” and were shown how to turn the knob on 
the light apparatus to vary the size of the light 
patch over its full range. Then they were told 
that their task was to match the size of the light 
patch on the apparatus to the size of various objects 
they would be shown. 

Twelve judgments were made by each S, four of 
each of the sizes of coin or disc. 


Apparatus 


The discs or coins were placed singly on the 
table, about twelve inches from the center of the 
light patch which served as the variable stimulus. 
The child faced the apparatus with the coin or disc 


on his left. The eye of the S was from 17 in. to 
19 in. from the rectangular front of the apparatus 
which was slightly below eye level. The front of 
the apparatus, 9 in. square, consisted of a 5 by 5-in. 
ground-glass window on the center of which was 
projected the light patch. The light source con- 
sisted of a 10-w. concentrated-arc point-source lamp 
(Western Union) whose light source of .4-mm. 
diameter limned a sharp image of the six-chorded, 
nine-chorded, or circular figures used. These latter 
were obtained by cutting appropriately shaped 
apertures in copper screens and mounting these as 


needed on a rack and pinion between the light 
source and the ground glass. The distance of the 
aperture screen from the light source (determining 
the size of the cast image) was controlled by a knob 
on the lower left of the front of the apparatus. The 
size of the light patch was variable from a diameter 
of 4.5 cm. to 1.2 cm. The brightness of the patch 
was 5.7 apparent foot-candles. 


RESULTS 


We do well to examine anew the hypothesis 
we are testing. The most general hypothesis 
is that the value of objects will somehow pro- 
duce a constant error in the judgment of 
their size. The constant error may manifest 
itself in one or both of two ways. First, it 
may lead to a difference in absolute apparent 
size. That is to say, a quarter may be per- 
ceived as larger than a disc of the same 
physical dimensions, and so, too, for pennies, 
nickels, etc., and their correspondingly sized 
discs. It is this measure that was central in 
the earlier studies of Bruner and Goodman 
and Carter and Schooler, and it seems logical 
enough. 

A second measure is relative in basis. Let 
us state it this way: the increase in overesti- 
mation going from a penny to a quarter 
should be greater for coins than for cor- 
respondingly sized discs. This should be so 
since the increase in value is greater in this 
size range for coins than for discs. A quarter 
is worth considerably more than a penny; a 
quarter-sized metal disc or cardboard disc is 
worth little more than a penny-sized one. 
This important measure was overlooked by 
both Bruner and Goodman and Carter and 
Schooler. 

In addition to the main hypothesis, stated 


TABLE 2 


Per Cent OVERESTIMATION oF Corns, Metat Discs, 
AND Paper Discs (Expressep as Per Cent 
DEVIATION FROM PurysicaL DiAMETER) 
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1.47 
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7-14 
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* Based only on those Ss who judged the coins in 
a position on the table. Those who held the coins 
in their hand are omitted. This renders the three 
groups comparable. The values for the “coin-in-hand” 
group were 1.02, 6.99, and 15.17 for penny, nickel, 
and quarter respectively, statistically indistinguishable 
from the “coin-on-table” group. 
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in terms of the two measures above, there are 
several others. It was our prediction that a 
value-oriented set would increase the accentu- 
ative effect of the value of objects. We also 
predicted that a more “ambiguously” shaped 
light patch would yield more judgmental 
accentuation than a circular light patch, 
although, we admit, it is hardly clear why 
this should be the case save that it follows 
some sort of loose axiom to the effect that any 
kind of ambiguity helps along. 

First, as to results in terms of grand 
means—the summary effects of object values, 
pooling all subconditions—we find in Table 2 
a statement of the amount of overestimation 
for the various objects. As in the Bruner- 
Goodman study, there is a significant differ- 
ence between coins and cardboard discs. The 
t tests between coins and correspondingly 
sized cardboard discs reach confidence levels 
of .o5 for nickel and quarter and .10 for 
penny (Table 3). But it is quite apparent 
that in terms of absolute level of accentuation, 
our results are negative where a comparison 
of coins and metal discs is concerned. To be 
sure, an analysis of variance, which we shall 
discuss more freely below, shows that type of 
object being judged is a significant source of 
variance, but this result is a function of some- 
thing other than any consistency in overesti- 
mation of the coins relative to the metal discs. 
It probably reflects the greater apparent 
magnitude of the two metals as compared 
with the paper. 

But a more meaningful pattern emerges 
when one examines not absolute level of 
accentuation, but relative level. In reviewing 
Carter and Schooler’s findings, we remarked 


TABLE 3 
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14.51 8.54 
N 30 30 30 


5.01 5.09 
1.61 
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* Based only on the “coin-on-table” group. Corres- 
ponding figures for the “coin-in-hand” group are 
indistinguishable from this group, being 5.97, 8.18, and 
14.15 for PN, NQ, and PQ, respectively. 


on the fact that the range of overestimation of 
coins—going from a penny to a half-dollar— 
was greater than the range of either metal or 
paper discs. The same holds for our data. 
In order to check the matter statistically, the 
following procedure was adopted. For each 
S we computed the magnitude of change in 
overestimation as he went from a penny to a 
nickel (PN), from a nickel to a quarter 
(NQ), and from a penny to a quarter (PQ). 
That is to say, if a given $ showed 1.5 per cent 
overestimation for the penny and 14.6 per 
cent overestimation for the quarter, his PQ 
shift was scored as the difference between the 
two, 13.1 per cent. Ss in the three groups 
were then compared (Table 4). 

First, a comparison of the 30 children in the 
Paper and the 30 in the Metal Groups. There 
appears to be no reliable difference between 
the two groups. A ¢ test between the mean 
PQ magnitudes for the two groups yields 
insignificant results (Table 5). 
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Note: A one-sided test was employed. 
® pS=.01. 
** pS.001. 
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Quite a different picture emerges upon 
comparison of the Coin Group with either 
of the disc groups. In a comparison of the 
30 Ss judging coins and the 30 judging metal 
discs, the two arranged at random, we find 
in 24 out of the 30 comparisons that the 
former show a greater increase in accentua- 
tion from a penny to a quarter (a proportion 
significant at the .05 level according to the 
sign test of Dixon and Mood [3]). The PQ 
increase in accentuation is 5.97 percentage 
points greater for the Coin than for the Metal 
Group. A ¢ test of the PQ difference scores 
of the two groups yields a figure of 3.59, sig- 
nificant at less than the .o1 level of confidence 
(Table 5). 

Similarly, a comparison of the Ss judging 
coins and those judging paper discs shows 
that the increase in percentage overestimation 
going from a penny to a quarter is greater 
for Ss in the Coin Group in 25 out of 30 
cases (significant at the .o1 level). The dif- 
ference in increase of accentuation (PQ) 
between the two groups is 7.81 percentage 
points. A ¢ test of the difference yields a 
figure of 4.31, again beyond the .o1 level of 
confidence. 

In sum, then, we find that the relative value 
of an array of objects does have an accentu- 
ative effect on their apparent size, but that 
this effect appears to be not in terms of abso- 
lute apparent size, but rather in the apparent 
relative size of the objects in the array. Put 
concretely, the overestimation of a quarter in 
relation to the overestimation of a penny (or 
nickel) is significantly greater than the over- 
estimation of a quarter-sized disc in relation 
to the overestimation of a penny- (or nickel-) 
sized disc. 

In analyzing this difference further, it is of 
interest to note that the quarter plays a rather 
crucial role in determining our results. Thus, 
the proportional increase in overestimation 
between a penny and a nickel is not notably 
or significantly greater than it is for discs of 
penny and nickel size. Where the difference 
occurs is between the penny and nickel on 
the one hand and the quarter on the other. 
Tables 4 and 5 summarize the various differ- 
ences discussed in the preceding paragraphs. 

We may turn now to the effect of our 
attempt to induce value- and accuracy-oriented 
sets in our young Ss. The mean percentage 


TABLE 6 


Per Centr Over- AND UNDERESTIMATION OF COINS 
aNp Discs UNDER VALUE AND Accuracy Sets 
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Value Set 
Coins-on-table -03 .48 41 15 
Coins-in-hand .80 -40 .58 15 
Metal discs -40 -70 -45 15 
Paper discs .80 83 -23 15 
All objects * -19 -67 -70 45 

Accuracy Set 
Coins-on-table 
Coins-in-hand 
Metal discs 
Paper discs 
All objects ® 


3.30 : 18. 15 
2.83 6. 13. 15 
3-68 10. 14. 15 

—1.25 5. 7.83 15 
1.91 8. 13. 45 





* Excluding “coin-in-hand” group because of its non- 
comparable judging conditions. 


of absolute over- or underestimation for the 
various coins and discs is presented in Table 6. 
This table contains a summary of the in- 
creases in overestimation as a function of the 
increasing value of the objects being judged. 

Consider first the differences in absolute 
size contained in Table 6. We note first a 
general and somewhat surprising tendency of 
our Ss to overestimate both coins and discs 
more when operating under accuracy precon- 
ditioning. A test of the reliability of this 
trend is provided by a ¢ test of the mean over- 
estimation of all objects (coins and discs) 
judged under the two conditions of set (see 
Table 8). The results of such a test yield 
figures of 0.47 for penny sizes (not signifi- 
cant), and 2.08 and 2.42 respectively for nickel 
and quarter sizes (p<.05). 

As for relative increase in overestimation 
(Table 7) as a function of increasing value or 
size, a somewhat different picture emerges. 
Here accuracy set has the effect of producing 
reliably greater incrcases in overestimation as 
one goes from penny sizes to quarter sizes in 
the Disc Groups (Table 7). No such tendency 
is present in the Coin Group, where accuracy 
and value set appear to have no differentiat- 
ing effect whatever, at least on those coin Ss 
who had the coin placed on the table before 
them during judgment.” Reliability levels are 
presented in Table 9. 

2 The one exception to this was the group of Ss who 


held the coin-to-be-judged in their left hands, again 
12 inches from the light patch. 
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TABLE 7 


Retative Increase 1n Per Cent Overestimation oF Corns AND Discs uNpDER Two ConpiTions or Set 








ConpDITION Co1n-on-TAaBLe 


Cotn-1n-Hanb 


Aut Osyects * 


Merat Discs Paper Discs 





Value Set 
Penny to nickel (PN) 
Nickel to quarter (NQ) 9.18 
Penny to quarter (PQ) 17.38 

N 15 


8.20 


Accuracy Set 
Penny to nickel (PN) 
Nickel to quarter (NO) 9.40 
Penny to quarter (PQ) 14.90 
N 15 


5.50 4-75 
7-17 
11.92 


15 


4.48 

4-03 

8.51 
45 


4-63 

0.40 

5-03 
15 


6.25 

2.83 

9.08 
15 





* Excluding “coin-in-hand” group. 


Up to now, we have not considered the 30 
Ss who judged coins held in their hand, as in 
the original Bruner and Goodman experi- 
ment. In the data previously presented, as 
noted tabularly, the hand-table differences 
were negligible. Here, however, an interest- 
ing difference is found. Recall that accuracy 
set produces a bigger differential in over- 
estimation for discs than does value set, but 
that the two sets have no differentiating effect 
on the coin-on-table group. In the coin-in- 
hand group, however, value set produces a 
significantly larger differential than does 
accuracy set. And so we have, in summary, 
accuracy set bringing about reliably greater 
ranges of overestimation for discs, set making 
no difference in overestimation range for 
coins placed on the table, and value set pro- 


TABLE 8 


Reviasmitry or Dirrerences (t) Between VALvue 
aNnp Accuracy Sets in Per Cent Over- anp 
UNDERESTIMATION oF Corns AND Discs 





ConpDITION Penny t NicKket ¢ QuarTer t 





Value Set vs. 
Accuracy Set 
Coins-on-table 
Coins in hand 
Metal discs 
Paper discs 
All objects § 


—1.47 
—1.06 0.26 

0.59 
—2.19° 
—2.08°® 


—o.98 
0.26 


—0.47 





Note: A one-sided test was applied. 

+ Negative values indicate that there was greater 
overestimation under accuracy set. 

© p<.05. 

°° p<.01. 

§ Excluding “coin-in-hand.” 


ducing a reliably greater overestimation range 
for coins held in hand. 

The effect of the shape of variable light 
patch used by Ss in matching the sizes of 
objects concerns us next. We may conserve 
space and tabular presentation by stating that, 
in terms of both absolute levels of size judg- 
ments and relative amount of overestimation 
with increasing value, this factor produced no 
reliable differences. The only trend worth 
noting—it falls short of statistical reliability— 
was for the nine-chorded variable figure 
(similar to the one used by Bruner and 
Goodman) to yield the largest amount of 
overestimation of the coins, and the least 
overestimation of the discs. Table 10 presents 
the data relevant to this comparison, combin- 
ing the data for metal and paper discs. 

We may say a few words at this point 
about the interaction of the conditions we 


TABLE 9 


Reviasitity or Dirrerence (¢) Between Vavur 
AND Accuracy Sets In INCREASE OF OveR- 
ESTIMATION OF CoINs AND Discs 








ConDITION PN + 


NQt 





Value Set vs. 
Accuracy” Set 
Coins-on-table 
Coins-in-hand 
Metal discs 
Paper discs 


—0o.60 

0.75 
—0.79 
—1.23 


0.23 
2.10° 

—1.39 

—1.56 


—0.44 
2.24° 
—1.68° 

—3.16 





t Negative quantities indicate accuracy sect yields 
larger PN, NQ, or PQ scores than does value set. 
* p<..05. 
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TABLE 10 


Pex Cent OvERESTIMATION OF Coins HeLp 1n Hanp 
AND Discs WHEN DirFERENT SHAPES 
oF VariaB_e Licut Patcn ARE 
Usep For JUDGING 








MarTeE- 
RIALS 


PATCH 


SHAPE Penny NICKEL 


QUARTER 





Six-chord coins .80 
discs 1.52 
coins 2.01°* 
discs 48° 
coins -91 
discs 3.07 


15.25 
9-97 

17.06° 
8.51°* 

14.95 
9.58 


5-25 
7-29 
8.50° 
5.01°%* 
7-45 
7-990 


Nine-chord 


Circle 





* Largest coin estimate. 
** Smallest disc estimate. 


introduced into the experiment. As a first— 
and, finally, hopeless—attempt to approach 
the problem, a large-scale analysis of variance 
was carried out. Six sources of variance were 
built into the design: value or accuracy set, 
the materials of discs or coins, the sizes of 
the discs or coins, the shapes of the light 
patches, the order of judging the sizes, and 
whether judgments were ascending or de- 
scending from large or small extremes of the 
light patch. The data of the analysis were 
arc sine transformed percentage deviations 
from the physical sizes of the discs or coins, 
with total degrees of freedom equal to 1439 
(120 Ss, each performing 12 judgments). 

As is so often the case, such hyper-designs 
prove to be more sensitive than the investi- 
gator who must interpret them. With the 
exception of light-patch shape, all primary 
sources were significant. So too, many of 
the interactions—some of the principal ones 
of these being materials sets, materials 
shapes, setsshapes, materials sizes, and 
materials X sets X shapes. Since we have not 
found it possible to discover corresponding 
psychologically meaningful trends in the 
ordering of means for these subconditions, we 
can only rest with the conclusion that there 
is still plenty to be learned here by some sub- 
sequent investigator.’ 

Finally, we consider briefly some character- 
istics of the judging process used by our Ss. 
Thirty 10- and 11-year-old children, not in the 
original design, were given the task of match- 
ing the light patch on our apparatus with 


8 The tabular data of the analysis of variance is 
available on request. 


light patches projected on a similarly shaped 
surface twelve inches away. Three light- 
patch sizes were presented for matching: 
penny, nickel, and quarter sized. A third of 
the children used the six-chorded variable 
patch, another third the nine-chorded, and 
the last third the circular patch. Judging 
conditions were as described before, as was 
procedure, save for omission of the value and 
accuracy games. We wished to find out the 
trend in judgment tendency when the match 
was made as homogeneous as possible, leav- 
ing out the influence of objects such as discs 
or coins if possible. More specifically, we were 
interested in finding out whether the tend- 
ency toward increasing percentage overesti- 
mation with increasing size would be present 
if the thing whose size was being judged 
were not a tangible, palpable object. 

The results are contained in Table 11. 

It is quite apparent from these figures 
that the progressive overestimation tendency 
noted for discs and coins is not a feature of 
these judgments. Rather, as one goes from 
penny to nickel size, there is a rise in over- 
estimation for all conditions, and then a 
decline. This was not a regular tendency 
among our 30 Ss, but was accounted for by a 
few Ss in each subgroup who grossly over- 
estimated the nickel size. 


CONCLUSIONS 


In the main, we have been only partially 
successful in isolating conditions which affect 
the apparent size of differently valued ob- 
jects. The resolution of the differences be- 
tween Bruner and Goodman and Carter and 
Schooler is far from complete. 

Certain points seem relatively clear. Carter 
and Schooler are, of -course, correct in re- 
marking upon the large number of factors 
which influence the absolute apparent size of 
an object. In the present experiments, we 


TABLE 11 


Per Centr OVERESTIMATION oF A Licut Patcu 








Nicket- QUARTER- 
Size N 


Penny- 
Size 


VARIABLE 
Licnt Patcu 





Six-chord 

Nine-chord 

Circular 
Total 


11.40 8. 10 
11.78 : 10 
10.20 ‘ 10 
11.13 -18 30 
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find that the value of objects does not un- 
equivocally affect their judged absolute size. 
There are too many physical differences in 
the texture, illumination, etc., of the three 
sets of materials—coins, metal discs, and 
cardboard discs—to risk any statement about 
absolute size. Coins and metal slugs are 
judged significantly larger than cardboard 
discs. Were we surer of the control of physi- 
cal factors, we might hazard the guess that 
metal slugs may “remind” a ten-year-old suf- 
ficiently of real coins to behave somewhat like 
them. 

The crucial difference, not a psychologically 
surprising one, is that, as one increases the 
value (and size) of coins, the extent of 
overestimation increases significantly more 
markedly than is the case with metal discs or 
paper discs. This we refer to as relative 
accentuation, and it is present in the Bruner- 
Goodman results and appears to be present in 
those of Carter and Schooler. 

As for the comparison of value orientation 
with accuracy orientation, two trends should 
be noticed. There is a tendency for accuracy 
set to increase absolute overestimation of discs 
and coins. With respect to relative accentu- 
ation, an accuracy set increases it for value- 
less discs. For coins, on the other hand, a 
value set augments relative accentuation 
under certain conditions. Under others, set 
has no effect on coins. 

It is difficult to say anything definitive 
about the effect of the shape of variable light 
patch used. We have noted merely that the 
use of a nine-chorded patch tends to have a 
slight effect leading to the overestimation of 
coins and to the underestimation of discs, 
which may have contributed to the larger 
differences earlier obtained by Bruner and 
Goodman. 

In conclusion, it appears to us that what 
differences remain between the original find- 
ings of Bruner and Goedman and Carter and 
Schooler will have to be resolved not by 
repeating the conditions of their experiments, 
but by devising new types of experiments 
designed to get at basic processes in the areas 
of perception and judgment. It is difficult to 
be sure of one’s findings when one uses coins, 
metal discs, and cardboard plaques as stimuli; 
it is also difficult to assess the contribution 
either of flceting instructional sets or of a 


child’s family’s incorne. A far better ap- 
proach, for example, is the study by Lam- 
bert, Solomon, and Watson (6) where a 
money-value system is built up and extin- 
guished experimentally and where the physi- 
cal characteristics of stimuli are not under the 
basic control of the mint. 

Finally, on the basis of the present study 
and on such others as those mentioned above, 
it is evident that the value of objects affects 
their phenomenal appearance. That it should 
affect their apparent size in some instances, 
as with coins, is probably a function of a 
pairing in the environment between value 
and size. Value may well have the effect of 
influencing other phenomenal attributes still 
more. While the present article is in intent 
technical in the more restricted sense, we 
must note that the importance of phenomeno- 
logical transformations which reinforce social 
conditioning is central to social psychology. 
Next steps in research, we feel, should be less 
concerned with demonstration of effects and 
more preoccupied with the process underly- 
ing such effects. Because our main finding 
has been that the effect of value is to increase 
relative accentuation but not absolute accen- 
tuation, there is reason to believe that not 
only strictly perceptual but also important 
judgmental phenomena such as anchoring 
and assimilation are at work here. 
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RELATIONSHIPS BETWEEN RORSCHACH SCORING CATEGORIES 
AND THE TOTAL NUMBER OF RESPONSES? 


DONALD W. FISKE AND 


University of Chicago 


HE outstanding recognition accorded to 
the Rorschach technique has brought it 


to the attention of those research in- 
vestigators who seek to further the scientific 
understanding of personality. Rorschach 
himself noted its research value (15, p. 13). 
These experimenters, together with those in- 
terested in making the interpretation of 
Rorschach protocols more objective, have 
tried to convert the procedure into a stand- 
ardized psychometric instrument. Investi- 
gators have looked for critical diagnostic 
signs, with dubious success. Such laborious 
statistical methods as factor analysis have 
been used in the effort to increase our under- 
standing of the test (9, 19, 20, 21). Cron- 
bach (5) has provided an excellent guide to 
the many difficulties involved in the quanti- 
tative investigation of Rorschach protocols. 
The purpose of this paper is to examine 
one aspect of the adequacy of the Rorschach 
test as a psychometric instrument: to what 
extent are the frequencies of responses in the 


several scoring categories related to the total 


number of responses? It is hoped that the 
results of this study will be useful to investi- 
gators who might want to use the test for 
research purposes, and also to clinicians using 
the test in clinical practice. Unless the Ror- 
schach is to be used solely in a qualitative 
manner, it is essential that the properties of 
the various scores be studied empirically. 

Lest there be any misunderstanding, let us 
state explicitly that in this paper we are not 
primarily concerned with the validity prob- 
lem. A gradually accumulating body of 
evidence suggests that some Rorschach scores 
do measure consistent aspects of a person’s 
response tendencies (8, 14, 18). While defini- 
tive investigations are not available, there is 
strong reason to believe that many of Ror- 
schach’s insights point to dimensions of 
temperament and personality which when 
isolated and adequately measured will help 
to structure the complex phenomena studied 
in the embryonic science of personality. 

1We are indebted to the following colleagues for 
their critical comments on this manuscript: Drs. Samuel 
Beck, George Calden, Lee Cronbach, David Grant, E. 
Lowell Kelly, E. H. Porter, Jr., and Morris Stein. 

We are also indebted to the Social Science Research 


Committee of the University of Chicago for a grant 
facilitating this study. 


EMMETT E. BAUGHMAN 
University of Wisconsin 


Sources oF Data 


Outpatients. To investigate differences between 
Rorschach psychograms obtained by different exam- 
iners, Baughman (1) collected 633 protocols from 
the files of the Chicago VA Mental Hygiene Clinic. 
These included all cases examined by each of 15 
examiners. The number of records per examiner 
varied from 21 to 69, except for one examiner 
who had tested 126 cases. The median number 
of records was 30. The records, collected over a 
three-year period, were from male veterans whose 
psychiatric status ranged from mild neurosis to 
definite psychosis, with abnormal cortical function- 
ing present in some cases. The majority were, 
however, neurotics. 

These examiners followed the scoring system of 
Beck (2) rather closely, but added certain variables 
from the approach of Klopfer and Kelley (11). 
The scoring of the original examiner was used in 
all cases. Baughman’s study found that for each of 
22 scoring categories there was at least one examiner 
whose records had a distribution significantly dif- 
ferent from that for all 15 examiners combined. 
Hence the examiners were divided into two groups: 
Group A included 9 examiners, each of whom had 
a distribution of R (total number of responses) 
which did not deviate significantly from that for 
the total group. Group B included 6 examiners 
with deviant distributions on R. The examiners in 
Group A had significantly deviant distributions on 
from © to 6 scoring categories out of 22, whereas 
those in Group B deviated on from 7 to 12 cate- 
gories. The total number of records from each 
group was 259 and 374 respectively. Some analyses 
were made for each subgroup separately, as well as 
for the two groups together. For simplicity in ex- 
position, however, the results for Group B by itself 
are not presented. 

Normal group. Beck has reported his analysis of 
a group of 157 employees of a mail order house (3). 
He and his associates sought to obtain “a repre- 
sentative sample of the community.” The group 
contained 71 males and 86 females. The age range 
was 17 to 69 with a mean of 30.5. Four vocational 
groups and a wide range of educational achieve- 
ment were represented. Our analysis indicates that 
the two examiners who collected ali the records had 
similar distributions of R, even when broken down 
into the four vocational groups. Each tested the 
same proportion of each of these four groups.” 


ANALYsIS 


To facilitate tabulation, each case was 
placed into one of nine classes on the basis 
of R. For each class in each sample the 
median frequency of each Rorschach score 
was obtained. The median was selected as 
the measure of central tendency because most 

2 The psychograms were made available to us through 


the courtesy of Dr. Beck and Dr. Thetford. We are 
deeply indebted to them for their cooperation. 
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TABLE 1 
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* One case in the normal group had 9 responses. It was added to the 6 cases in the 2nd R group. 
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TABLE 1—Continued 
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** Data not available on many cases. 


*** Total N: Outpatient A group, 259; All outpatients, 633; Normal group, 157. 


Rorschach scoring categories have marked 
skewness in their distributions. The medians 
are presented in Table 1. 

While the frequency in some classes is 
unfortunately small, we felt that a more 
coarse grouping might obscure some trends. 
However, it must be recognized that the 
reliabilities of the medians vary from class to 
class because of differences in frequency. 

To provide a single index for the degree 
of relationship between each scoring category 
and R, contingency coefficients (C) were 
computed. These are presented in Table 2. 
In the analysis of each scoring category for 
each sample, the median was obtained first. 
Then each R class was divided at this median 
for the sample, yielding a 2a table of fre- 
quencies, where n is the number of R groups. 
The value of » was g for the outpatient popu- 
lation. (In these computations, the two 
outpatient groups were thrown together be- 
cause the curves for the medians of the two 
groups were fairly congruent.) For the nor- 
mal sample, it was necessary to combine 
adjacent classes to obtain acceptable theoreti- 
cal frequencies. The three lowest R classes 
were combined. Contingency coefficients 


were computed from the resulting 27 
tables, and also from 25 tables obtained 
by combining the three highest R classes 
which had small frequencies. The two sets 
of coefficients were almost identical, those 
based on the 27 tables tending to be a few 
points higher. Since these larger tables 
seemed to represent the data better, the 
coefficients obtained from them are the ones 
presented in Table 2. Churchill (4) indicates 
that the maximal value for a contingency 
coefficient derived from a 2 Xn table is .71. 

On the basis of an examination of the 
contingency table for each score, a minus 
sign was attached to certain contingency 
coefficients to indicate instances where the 
relationship to R was generally negative. 
Inspection suggests that most of the obtained 
relationships between R and the several scor- 
ing categories are not simple linear relation- 
ships. The irregularities are in part a 
function of the number of cases. As a rule, 
the relationships are more regular for the 
total outpatient sample than for Group A 
alone, or for the normal sample. These data 
indicate that in Rorschach studies, as in other 
psychometric research, the amount of random 
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TABLE 2 
Revationsuip Berween Eacn Scorinc CarTecory 
and R 


(Coefficients of contingency) 








PoPULATIONS 





Scorinc 


CATEGORY Normat Group 


ALL OUTPATIENTS 
(N=633) (N=157) 


W +22 (.13)t 
D -63 -63 
Dd -51 -46 
M 45 41 
Cc +25 +34 
CF +30 -28 
FC +39 (.22)t 
=c +44 +33 
YF +30 .28 
FY +47 -38 
FV +33 -45 
F+ -56 -57 
F— +43 +49 
FY% —.16 —.26 
F+% —.19 —.34 
P -41 +39 
Ss -41 -36 
H +43 49 
Hd +47 +40 
A +52 +53 
Ad -47 -40 
An +30 037 
Sex +25 e 
A% -29 

4 . 





—.31 
(.23)t 


FM -45 ° 
m -41 e 
8-10/R +25 ad 

> 


Geog. +30 
Median of 
above variables -41 -38 
T/iR —.31 ° 
T/R —.30 ° 


+ With the exception of these three, all values of C 
are significant at the .05 level. 

* Omitted values were not computed because scores 
were not available, or because frequencies were too 
small. 





fluctuation found in small samples is so great 
that any trends in the data must be inter- 
preted with extreme caution. 

Most of the coefficients show significant 
positive relationships. However, in interpret- 
ing these relationships, one must recognize 
that all of the coefficients are “spurious” be- 
cause they are part-whole correlations: the 
total number of responses includes the fre- 
quency in each scoring category, and some 
categories (e.g., D) make major contributions 
to R. Nevertheless, this condition presum- 
ably has no general effect upon the form of 
the relationship. 

As Fig. 1 indicates, the various scoring 
categories show diverse types of relationships 
to R.' However, certain variables appear to 

1 Graphical representations of all relationships referred 
to in this paper (similar to those presented in Fig. 1) 


have been filed with the American Documentation Insti- 
tute. To secure a copy of these graphs order Document 


show similar relationships with R. Thus the 
curves for D, F+- and A have comparable 
slopes, perhaps because each represents a 
general tendency to give commonly perceived 
responses. These three categories have the 
largest contingency coefficients for both popu- 
lations. The same trend may be present in 
another cluster which also has approximately 
linear, but flatter, curves: FC, Sum C, FV, 
F—, and P. (The curve for P might re- 
semble that for D if there were no limit to 
the number of possible P responses.) Still 
linear, but almost horizontal, slopes are found 
for C, Y, YF, and Sex, all of which occur very 
infrequently. 

Most of the other variables appear to have 
curvilinear trends. Dd has a hyperbolic, 
rapidly accelerating curve. If it is true that 
a subject usually gives only one response to 
an area, it is not surprising that as total R 
increases, the number and also the proportion 
of responses to unusual details must increase 
even more rapidly. 

The three percentage scores, F°,, F+-%, 
and A%, show relatively low negative rela- 
tionships with R. As R increases, proportion- 
ally more responses tend to use uncommon 
determinants and content. It is interesting 
to note that with higher R, the latency of the 
first response (T/1R) also drops off. 

The relationship patterns for most of the 
other variables seem to be complex: they 
usually have a levelling off or a dip in the 
curve. For example, the curves for the total 
outpatient group for M, H, Hd, and Ad (and 
possibly for FM, FV, and Geog) show a 
fairly even increase except for a flattening 
or a slight reversal in R classes 6 and 7 (30 
to 39 responses). While the corresponding 
curves for the normal group are irregular, 
they are not congruent with those for che 
outpatient group, or even with each other. 
(Parenthetically, we may observe here that 
for both the outpatient and the normal 
groups, the H and Hd curves resemble each 
other, whereas the A and Ad curves are not 
similar.) 

A similar reversal of trend is found in the 
outpatient group’s curves for m, CF, Sum C, 





3701 from American Documentation Institute, 1719 N 
Street N.W., Washington 6, D.C., remitting $1.00 for 
microfilm (images 1 in. high on standard 35-mm. 
motion picture film) or $4.80 for photocopies (68 in.) 
readable without optical aid. A limited number of 
mimeographed sets of graphs are also available. To 
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sponses) is definitely below those for the two mal group does not repeat this effect. 
adjoining classes. (The Dd curve also has § How should we evaluate these irregulari- 
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ties? The magnitudes of the changes are 
not great, and the trends may be a conse- 
quence of the characteristics of the particular 
samples which happen to fall into certain R 
classes. We have not attempted to determine 
the statistical significance of these differences 
because it is meaningless to test a few excep- 
tional discrepancies out of many possible 
comparisons in these empirical data. Fur- 
thermore, we see no meaningful interpreta- 
tions of these specific trends, other than the 
broad generalization that individuals who 
differ in productivity (R) also differ in their 
thresholds for some determinants, locations, 
and content categories. We shall develop 
this point later. 

The two W curves show a remarkable con- 
gruence in pattern which is not repeated for 
any other variable. 

For the majority of the variables, the out- 
patient and the normal groups show fairly 
similar relationships with R: there are no 
sharp discrepancies. With the exception of 
FY, the general slopes are quite consistent. 
The median contingency coefficients are 
almost the same. The differences between 
the two sets of oT cients are probably not 
due to systematic differences between the 
groups. 

In the normal group, two of the lowest 
coefficients (both below the .05 level of con- 
fidence) are for W and Z. The value for W 
is also low in the outpatient group. Thus 
the capacity to organize or integrate a com- 
plex stimulus into a single percept is largely 
independent of productivity. 

There are differences between the two 
groups in the proportions of responses in 
various categories: the normal group had 
relatively lower scores on W, CF, YF, FY, 
and S$; but higher scores on F+-, F°4, F+-%, 
and H. While we did not tabulate the 
M : Sum C ratio, we can compare the two 
sets of medians for each group: for the patient 
group, Sum C is higher for almost every R 
class; for the normal group, M is higher 
except where R is below 20. We do not 
believe that safe inferences can be drawn 
concerning these differences since the groups 
differ in proportion of women and were not 
matched for age, intelligence, education, or 
socioeconomic status. 


Discussion 


The results presented above seemed to be 
in general agreement with the data published 
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by Wittenborn (20, 21) and Hughes (9). In 
each of those studies, factor analysis yielded 
what appears to be a productivity factor. We 
cannot, however, readily compare our con- 
tingency coefficients with the corresponding 
Pearson r’s (Wittenborn) or tetrachoric r’s 
(Hughes). Different scoring systems and 
different scoring categories were used. More- 
over, C and r are not directly comparable. 

Our data seem to differ from Wittenborn’s 
in another respect. He found no evidence of 
nonlinear relationships with R (20, p. 263) 
whereas many of the relationships in our data 
appear to be nonlinear. 

Scores based on the response frequency in 
a particular Rorschach scoring category are 
unsatisfactory psychological measures (cf. 5). 
While it is clear that frequencies in most cate- 
gories are related to the total number of re- 
sponses, we cannot state, on the basis of the 
present data, which influences the other. We 
believe that a subject may take a set toward 
a certain productivity level and then seek to 
find the necessary responses to reach it. On 
the other hand, it is true that subjects with 
predispositions to perceiving H and Hd re- 
sponses in Dd areas would tend to have com- 
paratively high R’s. 

Most of the category scores cannot be im- 
proved by handling them as percentages of 
the total number of responses (cf. Beck, et 
al. [3, p. 253]). While much of the relation- 
ship is removed by the process, there is still 
a relationship between the percentage meas- 
ures and R. In the cases of W, P, and A, 
percentage scores would he comparable if 
computed by omitting the first three re- 
sponses in that category, i.e. the formula 
(A—3)/R yields a score unrelated to R. 
Since the great majority of subjects give at 
least three responses in each of these cate- 
gories, individual variability related to R 
appears most clearly in the number of re- 
sponses beyond that point. Furthermore, the 
use of percentage scores cannot remove all the 
effect of R: the variability of these scores may 
itself be related to R. 

With respect to the relationship between 
P and R, Thompson (16) reports that for 
group Rorschachs administered to college 
students, the curve for P flattens out after 
forty responses. This tendency was not 
marked in either of our groups. 

For the clinical application of the Ror- 
schach, it would be possible to prepare norm 
tables for each R class (cf. Fonda’s table for 
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S[7, p. 374]). However, these would have 
to be based on sizable frequencies in each 
class to yield stable norms. Furthermore, 
separate norms should probably be prepared 
for various groups classified by intelligence, 
education, psychiatric status, etc. While such 
material would enable the clinician to be 
more objective in his evaluation of protocols, 
considerable labor would be involved in pre- 
paring the norms. Moreover, norms would 
be useless as long as examiner differences in 
scoring are present (cf. 1). 

The numerous irregularities in the rela- 
tionship curves are probably due in part to 
examiner differences and to examiner-subject 
interactions (cf. 12). A Rorschach response 
is determined by the subject’s perception of 
the total situation, including the instructions, 
the card, his previous responses, the purpose 
of the testing situation, and the examiner, as 
well as by his usual modes of reaction. In 
research studies, it is imperative that no 
examiners be used whose response distribu- 
tions differ appreciably from some standard, 
and whose method of administration is not 
highly consistent from day to day. On the 
other hand, for clinical purposes the Ror- 
schach can, of course, be used as a structured 


interview in any way the clinician sees fit. 
One may question the value of using the 
Rorschach technique in research studies to 


measure particular variables. The funda- 
mental insights of Hermann Rorschach can 
probably be incorporated into much more 
satisfactory instruments. For example, at the 
University of Chicago, L. L. Thurstone and 
Eckhard H. Hess are developing efficient tests 
for assessing color-form preferences. A series 
of such tests should yield a profile of response 
tendencies with more scientific value than the 
present Rorschach psychogram (cf. Zubin’s 
suggestions [22]). It may even be possible 
to create a technique for the simultaneous 
measurements of the principal dimensions of 
personality, thus preserving interactions be- 
tween dimensions and making the total pro- 
cedure less artificial. 

Productivity or reactivity should be meas- 
ured in a test or series of tests developed for 
that specific purpose. Such a test would 
require carefully worked out instructions. It 
is our tentative hypothesis that each subject 
perceives the Rorschach test situation as call- 
ing for some given degree of application to 
the task of finding percepts in the rather 
unstructured card. Energy output and rate 


of expenditure may be relevant concepts here: 
recall that the interval before the first re- 
sponse is negatively related to R. A similar 
relationship is found for time per response, 
T/R, (which probably measures the interval 
between responses) for a portion of the 
outpatient group on whom this value was 
computed. 

Given a set to expend a certain amount of 
energy on the task, certain perceptions are 
easier than others. Thus subjects with a high 
energy-output set give proportionally fewer 
W and P responses because these rapidly 
become more difficult after the first few; on 
the other hand, Dd responses are compara- 
tively easy for some subjects and may there- 
fore become proportionately more frequent. 
Accuracy of perceptions (F+-°%) diminishes 
only slightly with increasing productivity, 
and F— increases at a nearly constant rate. 
While M and Hd become proportionately 
more frequent, the other content and deter- 
minant categories make a fairly constant con- 
tribution to the total R. 

Can the relationships between scoring cate- 
gories and productivity be understood in 
terms of Maslow’s concepts of the expressive 
and the coping components of behavior (13)? 
Perhaps productivity is primarily coping or 
adaptive behavior. The subject meets the 
requirements of the test situation (as he per- 
ceives them) by giving a certain number of 
responses. On the other hand, many of the 
other scoring categories reflect the expressive 
component: the content and the determinants 
of a response may not contribute significantly 
to the subject’s adjustment to the immediate 
test situation but may essentially be products 
of the state of the organism. We would not 
deny, of course, that a particular response 
may be adaptive insofar as it reduces tension, 
or that the total number of responses may not 
be in part expressive of inner personality 
trends. We are suggesting that the major 
sources of variability may be usefully related 
to this dichotomy. We agree with the view 
of Klein and Schlesinger (10) that we should 
recognize the adaptive properties of percep- 
tual acts, but we would distinguish between 
the aspect of perception contributing pri- 
marily to adaptation to the external environ- 
ment and that aspect contributed by internal 
processes which are relatively independent of 
the specific external situation. 

What are the solutions to the problems 
raised by these data? We have suggested 
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above that the Rorschach test be replaced by 
a series of tests measuring the same variables, 
possessing a more firm psychometric basis, 
and retaining some provision for recording 
interactions between them. Such a radical 
plan is of no present value—it will take years 
to develop such tests. In the meantime, it 
may be profitable to explore ways of con- 
trolling R. Werner (17, p. 60) tried one 
method: since his groups had different mean 
R’s, he used only the first three responses to 
each card in computing Dd. Eichler (6) 
has employed the analysis of covariance to 
adjust for differences in R. Another possi- 
bility is to alter the instructions by giving a 
specific set for, say, three responses per card; 
the same result might be better achieved by 
stopping the subject after the third response 
and encouraging unproductive subjects until 
they give three responses. Such modifications 
would sacrifice some information obtained by 
the standard instructions. 

While the primary purpose of this paper 
has been to contribute to our understanding 
of the Rorschach test, we hope that the tables 
will be useful to the clinician. He can com- 
pare the psychogram of a given protocol with 
the median psychogram for members of our 
normal or outpatient group who gave a com- 
parable number of responses. 


SUMMARY 


For a group of 633 outpatients at a mental 
hygiene clinic and for the 157 cases in 
Beck’s normal group, the median frequency 
in each Rorschach scoring category was ob- 
tained for classes based on total number of 
responses (R). Contingency coefficients be- 
tween R and each category were computed. 
The findings were that: 

1. The relationships between R and each 
scoring category often appear to be complex 
and nonlinear. 

2. The form of the relationship with R 
seems to vary for the various categories 
although some have similar patterns. 

3. The forms of the relationships with R 
are fairly similar for the normal and the out- 
patient groups. 

On the basis of these data, we agree with 
Cronbach (5) that scores based on fre- 
quencies of responses in particular scoring 
categories are unsatisfactory psychological 
measures and that taking these scores as per- 
centages of R is only a partially adequate 
solution to the problem. For research on the 


dimensions of personality, improved meas- 
ures must be developed for promising Ror- 
schach variables. 
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PROPAGANDA, DEBATE, AND IMPARTIAL PRESENTATION AS 
DETERMINEKS OF ATTITUDE CHANGE 


R. F. JARRETT AND ALEX C. SHERRIFFS 
University of California 


PERSON maintains or modifies his attitudes 
and beliefs at least in part as a cog- 
nitive response to direct arguments to 

which he may be exposed. Several factors 
influence the attitudinal response which the S 
may make to such arguments. We have 
found it helpful in our thinking to categorize 
these factors into three classes: properties of 
the context within which the arguments are 
presented (contextual variables), properties of 
the arguments themselves (stimulus vari- 
ables), and properties of the subject (personal 
variables). These latter (personal variables) 
inhere in S himself and in the relations 
between him and the contextual and stimulus 
variables. 

Among the contextual variables we may 
list as examples the formal or informal debate 
between competing systematic presentations 
favoring alternative positions along the atti- 
tude continuum (which may or may not be 
unidimensional), aggressive propaganda of a 
single individual pressing the virtues of some 
particular position along the continuum, and 
(as in the educational situation) arguments 
supporting alternative—but incompatible— 
positions set forth by a single author in a 
context of impartiality. 

Some of the properties of the propositional 
stimuli themselves which may influence the 
nature of the attitudinal response induced in 
S would seem to be the extremeness of the 
attitudinal position argued for, the extent to 
which the arguments appeal to reason or to 
emotion, the medium of presentation (oral, 
written, newspaper, television, etc.). 

Most of these variables have been recog- 
nized by published research, though evidence 
concerning the nature of the influence of 
many of them is equivocal. 

Among the personal variables which may 
be related to the effect of such propositional 
stimuli (and among these factors are to be 
found those of greatest interest to the present 
writers) would be included the momentary 
position of S along the attitudinal continuum 


relative to the position of the argument (or 
arguments) to which he is subjected, the de- 
gree of importance which this attitudinal area 
enjoys in S’s value and belief systems,’ the 
origin of S’s present position on the atti- 
tudinal continuum, etc. 

The authors are currently engaged in a 
series of investigations on the influence of the 
variables outlined above on attitude change, 
and the present article is a report on a study 
of the nature of the influence of some of the 
contextual variables in attitude change. Spe- 
cifically, we were concerned with the question 
of the nature of the effect of the following 
types of presentation of arguments intended 
to modify attitudes: (@) a direct argument in 
favor of one rather extreme attitudinal posi- 
tion over the “opposite” position; (4) a 
debate situation in which the two systematic 
presentations are submitted in a controversial 
atmosphere; and (c) a situation in which a 
single individual presents both of these 
opposed positions in impartial juxtaposition. 

Our hypotheses concerning the effects of 
these modes of presenting our arguments are 
based on the fundamental assumption that 
the momentary position of a person on any 
attitudinal continuum is the end product of 
a complex learning process. In this learning 
process, some internal factors may operate 
either to oppose or to facilitate modifications 
in certain directions, regardless of the nature 
of the external stimulating conditions. Thus 
for a person whose position with respect to a 
particular attitude is very important in 
enabling him to explain much of his be- 
havior, in obtaining the approval of his asso- 
ciates, etc., these internal forces will oppose 
modification of the attitude in any direction 
which would make his new attitudinal posi- 
tion less useful to him and facilitate any 

1Some writers (4) would say that where the “at- 
titudinal area” is very important in the S’s value system, 
we are dealing with a true “attitude,” while the S's 
position in an “attitudinal area” which is relatively unim- 
portant to the S is a case simply of a “belief.” We 


prefer to think of attitudes as differing along a continuum 
of involvement. 
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modification which would make it more use- 
ful to him. While we are not here primarily 
concerned with the mechanisms of these 
internal forces, we may point out that if such 
a person is subjected to arguments for a posi- 
tion quite different from his own, he may 
defend himself against modifying his position 
by such varied means as denying that the 
proponent of the other position is deserving 
of consideration, misinterpreting the infor- 
mation offered, etc. Our views of the process 
by which attitudes are modified would lead 
us to the following predictions: 

1. For the direct argument in favor of an 
extreme attitudinal position, we would expect 
that the internal factors would facilitate the 
modification of the attitude in the direction 
of the argument for some Ss and resist that 
change for others, but that the net effect over 
all Ss will be to move them in the direction 
of the argument. The literature abounds 
with studies showing that it is possible to 
modify the attitudes of groups of Ss if one 
presents only arguments for one definite 
position (5, Ch. 13). 

2. For the debate situation—equally strong 
opposed arguments in an atmosphere of con- 
troversy—we expect that the groups will 
change in the direction of that argument 
which is on the same side of a neutral posi- 
tion as was the group, i.e., they will become 
more extreme in the positions held before 
hearing the arguments. For uninvolved Ss, 
this condition may be expected to have mini- 
mal effect, since the climate of conflict and 
the extremity of the two positions will throw 
doubt on the motives and understanding of 
both parties to the debate. Probably the 
simplest response for Ss in this situation is to 
reject both arguments as unworthy of con- 
sideration. To the extent that Ss depend on 
their attitude positions for maintaining self- 
esteem, etc. the debate may be expected 
generally to strengthen or increase existing 
“biases,” because Ss will find it easier to 
accept those arguments on their own side of 
the neutral position. However, the shift 
should not be as great as in the case of direct 
arguments alone since the opposing argu- 
ment must be granted some effect. There 
has been a suggestion on the basis of group 
data that such an intensification of previously 
existing bias may occur (3). 


3. For the situation in which two sets of 
equally strong opposed arguments are pre- 
sented in a context of objectivity and reason- 
ableness, we would expect Ss to become more 
moderate in their position—i.e., to approach 
the neutrai position. Uninvolved Ss _ will 
moderate their attitude because the objec- 
tivity and impartiality of the argument will 
lead them to ascribe authority to the writer, 
and, having no involvement, they will move 
toward the position of moderation which he 
is seen to represent. For more involved Ss, 
a similar result is to be anticipated, though 
perhaps to a lesser degree. In the debate 
atmosphere these more involved Ss can reject 
many of the arguments perceived as unfavor- 
able by believing the author to be biased, 
uninformed, unintelligent, etc. In the rea- 
sonableness climate this is more difficult, since 
the same person presents the supporting as 
well as the opposing point of view. Further, 
for college students, in any event, the values 
attached to reasonableness itself may be ex- 
pected to lead Ss to desire some identifi- 
cation with this moderate position of the 
author. The writers know of no previous 
study of direct relevance to the reasonableness 
condition. 


METHOD 
Experimental Design 


The experimental design for the present study is 
very simple. It involves obtaining a preliminary 
assessment of each S's position with respect to a 
defined attitude, submitting Ss to the influence of 
arguments intended to modify their attitude, and 
reassessing the position of each S. There were five 
experimental groups, one (the control group) not 
exposed to arguments relevant to the attitude and 
four groups exposed to relevant arguments as fol- 
lows: direct argument in favor of one extreme posi- 
tion with respect to the attitude; direct argument 
in favor of the “opposite” extreme position with 
respect to the attitude; opposed arguments, one in 
favor of the one, the other of the other, extreme 
position; both arguments presented as by a single 
objective and unbiased author. 

For all experimental groups the “arguments” 
were presented in such a manner as to conceal the 
intent to modify the particular attitude under in- 
vestigation. The nature and effectiveness of this 
disguise are discussed below. 


Subjects 


The Ss were 815 students enrolled during either 
of two successive spring semesters in the course on 
personal and social adjustment open to nonpsychol- 
ogy majors at the University of California in 
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Berkeley. In the 1950 semester there were 391 par- 
ticipating students and in the 1951 semester 424. 
Of the 391 in the first semester, 179 were men and 
212 were women; of the 424 in the second semester, 
158 were men and 266 were women. These Ss 
represented 77 per cent of the total student enroll- 
ment in the course during these semesters? and 
some 42 major fields offered within the university. 


The Attitude Investigated 


The problem as to what attitude to use in such 
an investigation is a troublesome one. It would 
seem that such an attitude should meet several 
rather restrictive criteria: (a) it should be one with 
respect to which all college students could be ex- 
pected to have some stand or position; (4) it should 
be one on which college students may be expected 
to differ widely, perhaps from extremely “pro” to 
extremely “con”; (c) it should be possible to assess 
individuals with respect to it; (d) it must be one 
concerning which “reasonable” arguments might 
be adduced in support of any extreme position; 
(ce) it should be one with respect to which well- 
defined criterion groups are readily available within 
the college population; (f) it must be of such a 
nature as to make it possible to relieve Ss of 
any tension which the experimental situation may 
develop in them, and to be useful in the investiga- 
tion of some of the problems of interest to the 
present writers; (g) it should be one about which a 
fair proportion of college Ss might be expected to 
have more or less strong emotional involvement. 

The attitude selected as best satisfying these cri- 
teria was the complex one held by individuals 
toward the relative “value” of men and women in 
our culture. Such “value” is implicitly defined in 
the “scale” which we used in assessing each S's 
position with respect to it. 


The Assessing Instrument 


A cursory search of the literature revealed no 
existing device for the assessment of this attitude, 
but its promise as a research attitude tempted us to 
construct one. 

A large number of behaviors and attributes of 
people which generally have value judgments 
applied to them were gathered from a wide variety 
of sources. Each of these behaviors and attributes 
was judged by the investigators and several col- 
leagues as to whether it was positively or negatively 
valued by the general population, and as to whether 
it was generally included within the stereotypes for 
males or for females. 

Fifty-eight of these behaviors and attributes were 
selected for inclusion in the battery of items. Half 
of these were items judged to be positively valued 
in varying degrees, half to be negatively valued in 
varying degrees.* Seventeen of these were judged 


2 The only class members not included in the study 
were those who were absent on one of the two test days. 

® Value ratings of these items made later by the Ss 
themselves (but before they were made aware of the 
nature of the experiment) corroborated these prejudg- 
ments. 


appropriate to the cultural stereotype for males, 17 
appropriate to the cultural stereotype for females. 
The remaining 24 items were believed to be unre- 
lated to conventional stereotypes of masculinity or 
femininity. A complete list of these items is pre- 
sented in Table 1. 

Each S was required in a forced-choice situation 
to state with respect to each behavior in the battery 
whether it applied more appropriately to men or to 
women. He was required, for example, to indicate 
whether men or women were in general best char- 
acterized as understanding the real needs of chil- 
dren, showing greatest emotional balance in crises, 
most likely to be careless in matters of personal 
cleanliness, most likely to hold a grudge, etc. 

A total score may be obtained from such data in 
a number of ways. After considering several pos- 
sible scoring schemes, we decided to use the 
simplest one, which involves assigning -+-1 or —1 
to each response as follows: any positively valued 
behavior ascribed to men and any negatively valued 
behavior ascribed to women contributes +1; any 
negatively valued behavior ascribed to men or posi- 
tively valued behavior ascribed to women con- 
tributes —1. There were 58 items on the list of 
behaviors used; thus the possible total scores ranged 
from —58 for a person who has only bad to say 
of men and only good of women, to +58 for a 
person who has only good to say of men and bad 
of women. 

We wished to keep Ss ignorant of both our inten. 
tion to assess them with respect to this attitude and 
our attempt to influence their attitudes. To achieve 
this, both the purpose of the experiment as a whole 
and the functions of each of its parts were disguised. 
The disguise in the case of the value battery was 
effected by requiring Ss to indicate not only that 
each behavior best characterized either men or 
women, but also requiring them to make two other 
forced-choices of classes of people to which the 
behavior best applied.* 

That we were successful in this attempt at dis- 
guise is suggested by the fact that when given the 
opportunity to indicate (at the conclusion of the 
experiment) what they thought it was all about, 
not one of the 815 Ss showed in any way his 
insight into the nature of the study. 


The Experimental Arguments 


The five written “arguments” (written so as to 
have minimal overlap with the items of the value 
inventory®) were prepared by one of us as follows: 


1. A strong pro-male argument. This consisted 


*Thus the first group of Ss (the 1950 class) was 
required to indicate whether each behavior best charac- 
terized 20-30 year-olds or 45-55 year-olds and also 
whether it best characterized Catholics, Jews, or Protest- 
ants. Since this last forced-choice (religious categories) 
aroused anxieties and antagonism in a number of the 
Ss in this class, it was changed for the 1951 class to a 
choice between “only children” and “people with one or 
more siblings.” 

5 There is specific reference to values included in the 
inventory in only four instances—twice in the pro-male 
argument and twice in the pro-female argument. 
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TABLE 1 


Irems oF THE “VALUE |NVENTORY” 











. Proportional to their numbers, most responsible for 
auto accidents in 1939 were: 

. Having the most understanding of the real needs 
of children are: 

.Most likely to be careless in matters of personal 
cleanliness are: 

. Most subject to violent outbursts of temper are: 

. Predisposition to insanity is more frequent among: 
. More likely to hold a grudge are: 

. Showing most consideration for persons older than 
themselves are: 

. Best losers at card games are: 

. Most conscientious in preparing for going to the 
polls on election day are: 

. Least likely to get upset over small details are: 

. Best equipped for the medical profession, other things 
being equal are: 

. Life is endured more than enjoyed more often by: 

.More marital difficulties arise from the jealousy of 
persons who are: 

. Showing greatest warmth toward those less fortunate 
than themselves are: 

. More courageous in the face of social disapproval are: 

. Most insightful are: 

. Most faithful in marriage are: 

. Reasoning ability is poorest in: 

. Showing greatest emotional balance in crises are: 

. “Carriers” of disease are more often found among: 

. Least likely to have an opinion of themselves which 
is higher than the opinion of them by their peers are: 

. Most likely to give up a principle in order to obtain 
social status are: 

. Least willing to accept responsibility for actions which 
will affect the happiness of others are: 

. Prone to defend stubbornly their own point of view 
against overwhelming evidence that it is incorrect 
are’ 

. More people report their greatest happiness was given 
them by: 

. The members of social service organizations who 
make the greatest individual contributions of time, 
effort, and results are: 

. Most often likely to resort to unfair tactics in com- 
petitive situations are: 

. If in charge of finances, most likely to go dangerously 
in debt are: 

. Most likely to make pretenses socially rather than to 
reveal their real attitudes are: 


30. Prone to take advantage of any situation to further 
their own selfish ends are: 

31. Individuals who are so dependent upon others that 
they have difficulty adjusting to life situations are 
more often: 

. Possessing the keenest senses of humor are: 

.More often tending to be competitive even in the 
most cooperative situations are: 

. Most able to base actions on objective facts rather 
than on irrational personal feeling are: 

. Most imaginative are: 

. Fundamentally more prone to impose one’s will on 
others are: 

. More courageous in the face of physical danger are: 

.Most prone to punctuality out of consideration for 
others are: 

. Personal conscience fails to inhibit reprehensible be- 
havior most frequently in: 

. Most creative are: 

. More courageous in the face of pain are: 

. The best leaders of small groups—s5 to 10 individuals 
—more often are: 

. General intelligence is highest in: 

. The persons who remain most loyal to friends and 
causes under fire are: 

. Most often swayed against their best interests by news- 
paper editorials are: 

. Physical defects are more common among: 

. Basic moral character is found to represent the 
highest values generally in: 

. Least likely to exaggerate their personal difficulties 
are: 

. Poorest losers at sports are: 

. Trustworthiness has been found to be highest in: 

51. More likely to find satisfaction in daydreams than in 
actual life situations are: 

- More inclined to exhibitionism are: 

53. Having least poise in awkward situations are: 

54. The sexual codes of their own group are more often 
violated by: 

55. Impetus for future social progress will come to the 
greatest degree from: 

56. Most likely to be sensitive to the feelings of others 
are: 

57. “Vanity” is a term applicable to more members of: 

58. “Keeping up with the Joneses” is more often a pre- 
occupation of: 





of approximately 600 words of material which was 
represented as the first page of a recent article in a 
psychological journal. The “article” begins with 
a statement that enough evidence is now available 
to lead to the conclusion that there is a true dif- 
ferential between the sexes strongly favoring the 
male. The results of specific studies, and conclu- 
sions supposedly drawn from many others, are 
brought together to “prove” the superiority of men 
in physical and psychological traits. For example, 
in “physical strength,” “top scores on intelligence 
tests,” “honorableness,” etc. 

2. A strong pro-female argument. The attempt 
was made to make this argument equal in pull to 
the pro-male argument. The material was written 
to resemble the pro-male argument as closely as 
possible with respect to length, amount of evidence 


cited, number of specific studies mentioned, etc. 
The hypothetical investigators mentioned in this 
argument were even matched with those mentioned 
in the pro-male argument as to probable national 
origin of their names. The physical and psycho- 
logical traits discussed were selected so as to corre- 
spond to those discussed in the pro-male argument. 
It was necessary, however, to introduce slight 
nominal differences in order to make plausible the 
simultaneous presentation of both pro-male and 
pro-female arguments in the “debate” and “impar- 
tial presentation” cqnditions (to be described 
below). Thus “physical endurance” was substituted 
in this argument for “strength” which appeared in 
the pro-male argument, “higher average intelli- 
gence” for “top scores on intelligence tests,” “high 
ethical standards” for “honorableness,” etc. 





DererMINERS OF ATTITUDE CHANGE 


3. An “impartial presentation” of both pro-male 
and pro-female arguments. Again the material was 
represented as the first page of a recent article in a 
psychological journal. In this case the author 
started with a statement that there is now evic'snce 
for an evaluation of the roles of the two sexes. 
Then there followed the content of the pro-male 
and the pro-female arguments within the frame- 
work of “on the one hand we find... and on 
the other hand... .” That is, a paragraph from 
the pro-male argument is followed by the corre- 
sponding paragraph from the pro-female argument, 
introduced by a suitable “impartial” connecting 
expression. 
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Procedure 


Each semester the experiment required two 
class meetings. On Friday each S filled out the dis- 
guised attitude “test,” and on the following Mon- 
day each was subjected to one of the five experi- 
mental conditions, following which he again filled 
out the “test.” The Ss were assigned to their ex- 
perimental groups in the process of distributing the 
materials. In the administration of the values 
inventory, Ss were told that the questionnaire was 
originally prepared “for the purposes of comparing 
the opinions of social psychologists with prevailing 
widespread beliefs concerning people and their func- 


TABLE 2 


Descriptive Sratistics—Pretest, Postrest, AND CHANGE ScoRES—OF THE TWENTY EXPERIMENTAL Groups 








1950 SEMESTER 


1951 SEMESTER 





Group WoMEN 


WomEN 





x 


s? x s? 





Controls 
Pre 
Post 
Change 


Pro-Males 
Pre 
Post 
Change 
Pro-Females 
Pre 
Post 
Change 


Debate 
Pre , 80. 
Post . 79. 
Change ‘ 68. 


Impartial 
Pre 155. 
Post 209. 
Change 73. 


103.61 . . 86 
135-27 
46.51 ‘ -62 60.89 


122.81 > 80.25 
169.54 
108.64 


130.09 74-50 
240.22 
131.50 


94.26 
168.25 . 
93-36 . 82.30 60.73 


158.87 
176.50 
64.54 


153.83 


248.60 152.44 


146.99 
264.13 
177.60 


168.50 
96.48 


166.44 
270.16 
166.68 


258.96 
152.31 


193.78 
291.37 


134 
156.38 


167.10 . 85.22 
171.34 152.52 
112.69 . 98.80 





4. The “debate.” This material comprised an 
introductory statement to the effect that two articles 
appeared in different psychological journals at about 
the same time, “and a controversy was on!” There 
then followed the identical material of the pro-male 
and pro-female arguments as the first pages of 
these two articles. Half the Ss had the pro-male 
“article” presented first and half had the pro-female 
“article” presented first.® 

5. The control material. This comprised the first 
page of an essay by J. H. Fabre on the calendar. 


There were more words in the “debate” situation 
than in the “impartial presentation” since certain repeti- 
tions, as of the statements of reasons for writing the 
articles, were not appropriate in the latter condition. 
There seems no reason to expect, however, any relation- 
ship between the kinds of effects and mere passage 
length. 


tioning.” They were told that the pooled opinion 
of the class might “turn out to stand in interesting 
relationship to that of social psychologists.” 

Our interest in the relationship between the argu- 
ments and the retest responses to the value inven- 
tory was effectively masked by interpolating several 
irrelevant tasks between the arguments and the 
inventory on the second day. 


REsULTs 


The main results of this study are set forth 
in the accompanying tables. Table 2 gives 
the N’s, means, and variances of pretest, post- 
test, and change scores for each of the 20 
experimental groups. 

Our primary interest was in the changes, 
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and these data were analyzed by an approxi- 
mate procedure. Means of the 20 subgroups 
were calculated and submitted to an ordinary 
analysis of variance (unweighted means). 
This is an inefficient procedure, utilizing only 
19 of the 814 degrees of freedom available, 
but it was used because of the complexity of 
the application of tests of significance to the 
analysis of variance in the nonorthogonal 
(disproportionate subclass numbers) case. 
The tests of significance must be interpreted 
with caution in any event because of the 
demonstrable inhomogeneity of subclass vari- 
ances. The result of this analysis is presented 
in Table 3.7. None of the interactions is sig- 
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of squares associated with experimental 
conditions. 

The analysis suggests that, over all experi- 
mental conditions, the male Ss differ from 
the female Ss with respect to their changes. 
Males tend to change toward higher scores 
and females toward lower scores. The F for 
sex in Table 2 is significant at the .05 level, 
and if the within-groups mean square were 
used as the estimate of experimental error, it 
would become significant at the .o1 level; 
there can be little doubt that this is a real 
effect. There is a suggestion that the 1950 
Ss differed from the 1951 Ss. Whether the 
triple interaction of Table 3 or the within- 


TABLE 3 


SumMMary oF ANALYsIs OF VARIANCE OF SuBCLASS MEAN CHANGES 








Source or VARIATION dj Sum oF Squares Mean SQuare F;t 








144.95 
35-99 
12.22 


579-79 
35-99 
12.22 
14.64 

6.39 
0.84 


Experimental Conditions 
Sex of Subject 
Semester 
Condition « Sex 
Condition X Semester 
Sex X Semester 
Condition * Sex 
< Semester 4 


4 
1 
1 

4 

4 
I 


4-69 


39.12 


Adjusted Within-Groups 795 


2.74 





+ F, is the value of F using the combined interaction 
the within-groups mean square, 105.09, adjusted so as to 


(harmonic) mean number of cases in the twenty subgroups. 


* o1< p=.05. 
ee p= Ol. 


nixicant, either in comparison with the triple 
interaction or in comparison with the within- 
groups mean square, so all interactions have 
been pooled to yield the error estimate. 
This table shows that the several experi- 
mental treatments gave rise to real differences 
in the changes between pretest and posttest 
performances. Reference to Table 2 reveals 
that for both men and women Ss the pro- 
male and pro-female arguments when pre- 
sented alone result in rather considerable 
mean changes in the direction of those 
arguments. These changes no doubt account 
for the larger proportion of the large sum 


7 As a rough check on the triple interaction used as 
the error variance in this analysis the within-groups 
mean square, based on 795 df and reduced to the 
basis of the mean number of cases in the several experi- 
mental groups, was calculated. The error term used 
was found to be insignificantly larger than this mean 
square, even with the very large df involved. 


mean square as the estimate of error. Fz is the value using 
be comparable with variation among means based on the 


groups mean square is used as an error term, 
this F is significant at the .o5 level and not 
at the .or level. It is to be noted that the 
draft was taking considerably more male 
students in 1951 than in 1950 and it may be 
that these selective influences are represented 
in the male Ss remaining in 1951 and that the 
social consequences of this phenomenon had 
some influence on the female Ss. We are 
unwilling to do more than speculate upon the 
explanation for this finding. 

The question which this study was de- 
signed to investigate concerns the relative 
effectiveness of arguments presenting cases 
for discrepant attitudinal positions in the con- 
text of a debate and in the context of impar- 
tial juxtaposition. To study this question we 
wished to compare the two groups after 
allowance had been made for the possible 
differences between the strengths of the two 
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TABLE 4 


MEAN CHANGE Scores FOR THE SEVERAL ExpEri- 
MENTAL CONDITIONS FOR MEN AND 
Women Susyjects 








Pro- Pro- Im- 


Tro. Mare Femace DEBATE PARTIAL 





—1.36 
—1.83 


2.38 


: 9.86 
—o.81 


7-94 


—6.90 
—8.84 


4-31 
—1.23 





arguments (i.c., the arguments for the two 
attitudinal positions) and for secular trend 
and the effect of having taken the pretest. 
This allowance must be made statistically, 
since it was not otherwise possible to equate 
the two sets of arguments, etc. Since the 
relative number of men and women was not 
constant in the two semesters studied, and 
since the mean changes of men and women 
under control conditions were significantly 
different (though only barely), it seemed 
better to examine these corrected-mean 
changes separately for men and for women 
than to make a single over-all correction. 
The data necessary to these comparisons are 
set forth in Table 4, and the corrected means 
in Table 5. 

We took as an estimate of the magnitude 
of change to be expected as a result of having 
taken the test three days earlier, the mean 
change observed in the appropriate control 
group. As an estimate of the magnitude of 
the difference in strength of the two argu- 
ments, we took the algebraic sum of the mean 
changes of the pro-male and the pro-female 
groups after both of these had been corrected 
for secular trend. If these two arguments 
had been equal and opposite in pull, the 
algebraic sum of the mean changes which 
they induced should be zero. Thus, for either 
sex, the mean change for the debate situation, 
corrected for trend and for differential pull 


TABLE 5 


MEANS FoR DEBATE AND IMPARTIAL CoNDITIONS 
ApjusTep For TREND AND DIFFERENCE IN 
Putt oF THE Two ARGUMENTS 








SuByEcTS DEBATE IMPARTIAL 





Men 


Women 


3-73 
—1.14 


—1.94 
—1.74 





of the arguments, is obtained by subtracting 
from the mean change of the debate group 
the sum of the mean changes for the pro- 
male and pro-female groups, diminished by 
the mean change of the control group. For 
the males the corrected mean change under 
the debate situation is thus seen to be: 4.31— 
(—6.90-+-9.86 —2.38) = 4.31 —0.58= 3.73. 

We have estimated the standard error of 
these corrected means to be about 2.5 for the 
male Ss and about 2.1 for the female Ss.* 

From Table 5 it will be noted that in gen- 
eral these corrected means are not sufficiently 
different from zero to raise any justifiable 
suspicion that, when corrected for the differ- 
ance in the separate effectiveness of the two 
arguments and for the influence on the 
second test of having taken the test before, 
the debate or the reasonable mode of pres- 
entation makes any substantial contribution 
to the effectiveness of the arguments. The 
largest (absolutely) of the four corrected 
means is that for the change of the men under 
the debate conditions; this about one and 
one-half times its standard error. 

Reference has been made above to the 
demonstrable inhomogeneity of the sub- 
group variances. Bartlett’s test applied to 


8 The standard errors of these corrected mean changes 
are readily derivable from first principles. The cor- 
rected mean is a special case of an algebraic sum of 
terms, cach of which is subject to error. In our case 
the several terms are mutually independent. Thus the 
sampling variance of our corrected mean is equal simply 
to the sum of the sampling variances of the four terms 
required in arriving at the corrected mean. In view 
of the relatively large number of uncertain quantities 
entering into the sum and the relatively small number 
af cases contributing to each, we would not wish the 
standard error of such corrected means to be forced to 
carry too large a burden of interpretation, and keeping 
in mind the demonstrable inhomogeneity of the subgroup 
variances, we may take as an approximation to the 
standard error of the corrected means for the males the 
product of the within-groups mean square (a kind of 
average subgroup variance) by the sum of the reciprocals 
of the N’s of the four groups involved, the debate (or 
reasonable) group, the pro-male group, the pro-female 
group, and the control group, all for male Ss. This 
yields a value of about 2.5 for the standard errors of the 
corrected means for male Ss. Since there were more 
female Ss than males, the corresponding standard error 
for the females is about 2.1. 

Solomon (6) has called attention to a similar scheme 
for correcting means for several kinds of disturbing 
influences and indicates that the standard error of his 
corrected mean is unknown. His situation is compli- 
cated slightly by a dependence among the several terms 
employed in the correction, but unless the mathematical 
problem is more involved than it seems to us, the 
method indicated above should yield a useful approxi- 
mation to the standard error which we require. 
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these data yields a chi square of over 80 with 
19 degrees of freedom, a highly significant 
value. It was thought of some interest to see 
whether we could make some sense of the 
nature of the inhomogeneity discovered. 
This was done by (a) transforming the sub- 
group variances to yield a variable in which 
mean, and variance were independent, and 
(4) performing an analysis of variance on 
these data. This analysis suggests that the 
chief contributor to the inhomogeneity of the 
variances was the experimental conditions— 
ic., the two direct arguments in general 
yielded larger variances of change scores 
than did the debate, reasonable, or control 
conditions. 


Discussion 


These results are in interesting relationship 
to our original hypotheses. We find the 
direct arguments for one definite position, 
whether pro-male or pro-female, having the 
effect of moving groups of Ss, both men and 
women, in the direction of those arguments. 
It had been predicted that Ss who were rela- 
tively uninvolved in the attitude in question 
would be influenced in this manner, while 
those who were involved might well resist 
such change, but that the net effect would be 
in the direction observed. The existence of 
some resistance to change within the groups 
is suggested by the differential effects of each 
argument on the two sex groups, and is fur- 
ther supported by the consistent shifts of the 
control groups in the direction of valuing 
more highly their own sex group (Tables 3 
and 4). The fact that the arguments pro- 
duced resistance in some Ss and facilitation 
in others is one possible explanation of the 
greater variances of the change scores in those 
groups subjected to direct arguments as com- 
pared with the other conditions. 

There are no clearly significant changes 
which occur when Ss are exposed to opposed 
arguments of approximately equal strength, 
and approximately equidistant from the neu- 
tral position for the attitude. This appears to 
hold whether the arguments are presented in 
a controversial, debate context, or in an 
atmosphere of objectivity and impartiality by 
one writer. Nonetheless, the direction and 
magnitudes of the net shifts occasioned by 
these contexts are in interesting relation to 
our original hypotheses. 


It will be recalled that we had predicted 
that for the. debate there would result a net 
shift in the direction of the existing bias of 
the Ss, and for the condition of impartiality 
there would result a moderating effect—a 
shift toward neutrality. For the men Ss we 
find behavior trends which are in accord 
with these predictions. With an original 
position favoring males over females, these 
Ss under debate conditions (after correction 
for trend and difference in pull of the two 
arguments) moved farther in the direction 
of favoring males. Under impartial presen- 
tation they shifted in the pro-female direction, 
to a more moderate position. 

For the women, interprethtion of trend is 
confused by the fact that thqse Ss, too, began 
with an attitude favoring the male. If one 
considers them simply as 4nother pro-male 
group, then they shifted fin the expected 
direction under the impartial condition, but 
not under debate. If, on tht other hand, one 
places emphasis on the cbnsideration that 
these Ss are in fact women, regardless of their 
pretest position, then a variety of explanations 
come to mind by which one can explain the 
findings for both contexts. (Unfortunately, 
explanations also become possible by which 
one can explain any behavior that might have 
resulted.) 

If further research should substantiate a 
generalization that for attitude change, de- 
bates and impartial discussions are equally 
impotent, and that straightforward propa- 
ganda in the lay sense is most effective, then 
this would have sobering implications. How- 
ever, the results of this investigation justify 
no such generalization. In this study, for 
example, we are dealing with but one atti- 
tude, with a limited set of written arguments, 
with the arguments far apart and approxi- 
mately equa! in strength, a writer unidenti- 
fied, except for the implication that he is a 
psychologist, with measures of change in 
degree but with no measure of change in 
stability of attitude position, and taking 
account of only the most immediate effects. 

Relationships requiring investigation in- 
clude four on which we have research now in 
progress. The first has to do with the ques- 
tion suggested earlier concerning the differ- 
ential effects of arguments as dependent on 
the degree of investment by Ss in maintain- 





DererMiners oF AtrirupeE CHANGE 41 


ing their attitude position. Our group effects 
may hide important individual differences. 
A hint that this may be so comes from the 
larger variances in the changes resulting from 
the presentation of single sets of arguments 
as compared with those resulting from the 
other conditions, including the control. 

A second question relates to the significance 
of the distance between the previously held 
position in regard to an attitude and the posi- 
tion urged by a set of arguments. The data 
presented in this article result from contacts 
by Ss with only extreme arguments on one 
or both sides. 

A third question is concerned with the 
effects of different contexts on the perma- 
nence of attitude change. Our measures of 
attitude change were all immediate. The 
posttest followed the stimulus material by 
approximately ten minutes. 

A fourth question deals with the relative 
effects of emotional and intellectual argu- 
ments upon such attitude shifts. It is par- 
ticularly interesting to speculate as to whether 
men and women are equally responsive to 
arguments of both sorts. 


SUMMARY 


As part of a program of research on the 
dynamics of attitude change, the attempt was 
made to influence an attitude (toward the 
relative “value” of men and women in our 
culture) by arguments presented in three 
contexts: the context of an exposition frankly 
in favor of a particular attitudinal position, 
the context of a debate between apologists of 
opposing attitudinal positions, and the con- 
text of an impartial presentation of the two 
incompatible arguments. Men and women 
Ss varied in their responses to these condi- 
tions, thus confounding the interpretation of 
the findings, but the results, though not in 


each case statistically significant, are not in- 
consistent with our hypotheses concerning the 
nature of attitude change. These hypotheses 
imply, in our view, (a) that under direct 
argument, groups of Ss should change in the 
direction of the argument. This implication 
was verified by our data, as it has often been 
before. (4) That in the debate situation, Ss 
would move in the direction of their pre- 
existing bias, the amount of this change 
depending upon the degree of emotional in- 
volvement in the attitude. The first part of 
this implication was confirmed for the male 
Ss but remains equivocal for the females 
because of the not-completely-unexpected but 
rather surprising pro-male initial position of 
the female Ss. Lack of a satisfactory measure 
of emotional involvement leaves us without a 
test of the second part of this implication, 
namely, (c) that in the “impartial presenta- 
tion” context Ss would become more mod- 
erate in their attudinal position. This was 
also verified for the males, but for the reason 
given above the results for the female Ss are 
hard to interpret. 
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education and guidance of youth have 
increasingly shifted their attention to the 
“normal” individual; they have more and 
more focused their concern on understanding 
the process of child socialization and the 
problems boys and girls face in becoming 
adjusted to their cultural milieu. As a part 
of this concerted effort, the concept of de- 
velopmental tasks has been emphasized. 
These are the common major tasks—certain 
learnings, adjustments, achievements—that 
face all individuals in a given society or sub- 
group of society; they are the modes of cul- 
tural adjustment expected of an individual 
and rewarded by his society. Since this con- 
cept attempts to interrelate the physical, 
social, and personality learning of the indi- 
vidual, it seems to offer -both a basis for 
integrating the widely divergent findings on 
human develcpment and an inclusive frame- 
work for the study of further problems which 
must be solved before there is a clear under- 
standing of the process of child socialization 
and hence of the total educative process. 
Havighurst’s work (2), which elaborates 
this concept, is a basic reference on develop- 
mental tasks in general, and particularly in 
our culture. A recent monograph by Rab- 
ban (5) provides a summary of much 
pertinent literature and excellent related 
background material describing the study of 
one task (sex role) with a younger age group. 
Some developmental tasks at various age 
levels have heen identified, but the problems 
of helping children directly in the achieve- 
ment of these tasks have yet to be solved. 
The present study was conducted in order to 
learn more about five developmental tasks of 
adolescence:* (a) learning an appropriate sex 


I’ the past decade those interested in the 


LAlthough only five tasks were studied, overt and 
covert aspects of the first two were considered separately. 
These five tasks have been chosen only because limita- 
tions of the data prohibited valid consideration of 
others. However, on the delineation of these par- 
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role (overt and covert); (4) achieving emo- 
tional independence of parents and other 
adults (overt and covert); (c) developing 
conscience, morality, and a set of values; 
(d) getting along with age-mates; and (e) 
developing intellectual skills. 

The study sought to further knowledge 
that may answer the fundamental questions: 
What personal characteristics are related, and 
in what patterns are they related, to each of 
the five developmental tasks that are being 
studied? What has been the genesis and 
developmental record of these characteristics 
in the life history of the individual? Such 
information may offer some insights into 
what adults may do to help boys and girls 
working on these tasks, and to determine the 
kinds of experiences young people need to 
assist them to take the next steps in growing 
up. It is possible that some educational and 
cultural influences may be determined which 
may be consciously manipulated to facilitate 
desired types of personal growth. 

The present study has two parts. The first, 
quantitative, will present the findings of the 
statistical analyses to determine factors and 
constellations of factors significantly related 
to good achievement of the five develop- 
mental tasks. The second, qualitative, will 
describe the genesis of these variables and 
trace their development briefly in the life 
history of certain uniquely achieving individ- 
uals. It is hoped that this study will further 
the systematic understanding of the social 
learning that takes place in child socialization 
in our culture. 


Pros_=M 


The general question toward which the 
statistical analyses of the data were directed is: 





ticular tasks there is rather general agreement; while 
varying somewhat in statement, they are included in 
some form in all narrations of developmental tasks of 
adolescence. 
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What are some personal characteristics that 
are related to successful accomplishment of 
each of the developmental tasks being 
studied? More specifically, the primary ques- 
tions to which answers were sought by this 
quantitative analysis of trait ratings in rela- 
tion to task ratings are: What are some 
significant factors characteristic of sixteen- 
year-old adolescents who have successfully 
accomplished each of these specific develop- 
mental tasks? Are these different from fac- 
tors characteristic of those who have failed 
at the task? What factors may be unique to 
each sex for accomplishment of each task? 
What factors may be common to the sexes 
for accomplishment of each task? What are 
the patterns of the significant factors—for 
each sex, for each task, and for the tasks in 
general? 

It is recognized that the statistical findings 
identifying group norms will not tell the 
entire story. In fact, quantitative analysis 
prompts the following additional questions: 
How do those characteristics found to be 
desirable originate and develop in individ- 
uals? What determines the level of efficiency 
with which an individual accomplishes the 


tasks of a particular age? 


METHOD 
Subjects 


The subjects (Ss) were 15 boys and 15 girls stud- 
ied intensively as a part of the Midwest Community 
Study Research Project, an elaborate interdiscipli- 
nary research program carried on since 1942 by the 
staff of the Committee on Human Development of 
the University of Chicago for the purpose of investi- 
gating character and personality development dur- 
ing later childhood and adolescence in a typical 
Midwestern community (Midwest). The Ss were 
selected from the original sample of children chosen, 
which consisted of the 115 children who had been 
born in 1932 and who in 1942 resided in Midwest 
and its surrounding rural territory. The selection 
of children to be studied intensively was made on 
the basis of being “adjusted” or “unadjusted,” as 
defined by arbitrarily determined composite criteria. 
In class-status rankings one boy is upper-middle 
class, 12 boys and 13 girls are either lower-middle 
class or upper-lower class, and 2 of each sex are 
lower-lower class. 

The cases chosen for intensive analyses from the 
30 Ss of this study were selected because they rep- 
resented the extremes in socialization patterns in the 
total sample, and should, therefore, show the most 
clear-cut contrasts. That boy and that girl whose 
composite ranking in levels of achievement for all 
the tasks at age 16 totalled the highest were studied 


intensively as examples of well socialized adoles- 
cents; that boy and that girl whose composite rank- 
ing in levels of achievement for all the tasks at 
age 16 totalled the lowest were studied intensively 
as examples of poorly socialized adolescents. A boy 
and a girl who at age 16 ranked in the lowest 
quarter on some tasks and in the highest quarter 
on others were selected for intensive study as sam- 
ples of incongruently socialized individuals. 


Social Setting 


Because a major assumption of the study im- 
plies that each society educates its children in its 
own particular manner, the social environment in 
which a boy or girl grows up assumes much im- 
portance. Midwest is a county-seat town with a 
population of 6,000, and with an additional 4,000 
living in the surrounding trade area, The city is 
within 100 miles of a metropolis. Agriculture, 
manufacturing, and retail sales are the chief sources 
of income. Approximately 90 per cent of the city’s 
inhabitants are native-born. Midwest is small enough 
so that a boy or girl can grow up and know 
“almost everybody in town,” yet it is complex 
enough to orient him to modern industry and 
society. 


Data? 


As a part of the Research Project, rich and 
extensive case-study materials were collected on this 
small group of Ss. The following basic data con- 
sisted of three kinds; the last two were included 
in the first, but since they are used independently 
of it, they are categorized separately. 

1. Extensive materials covering the life history 
of each of the 30 Ss from ages 10 to 16 were 
obtained by the staff of the Project. These repre- 
sented the results of 32 test instruments such as the 
Wechsler-Bellevue Intelligence Test, Minnesota Paper 
Form Board Test, Thematic Apperception Test; and 
10 instruments, such as Strength of Conscience Ques- 
tionnaire and Family Relations Questionnaire, de- 
veloped especially for the Research Project. There is 
also information in the following areas on each S: 
interviews; psychometric data; Ss’ reports via check 
lists and free responses; various ratings by acquaint- 
ances, teachers, and others; projective techniques; 
sociometric data; physical data; Clinical Conference 
and Moral Character Conference summaries and 
reports.® 

2. A minimum of eight Research Staff members, 
each of whom was well trained in the theory, tech- 
niques, and skills of one or more of the disciplines 
which the Research Project incorporated, made in- 
dividual ratings for each of the Ss on 47 personality 


2 Many individual Research staff members contributed 
to the collection and analysis of the data. Appreciation 
is expressed to them, without whose efforts this study 
would not have been possible. 

8 The Clinical Case Conference studied the data 
collected on this group from 1945 to 1947; it was 
followed by the Moral Character Conference which 
integrated all the earlier data and their interpretation 
by the earlier conference into its research. 
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and social role items at ages 10, 13, and 16; mean 
ratings for cach S were then determined and used 
in the study. For the most part, the ratings are 
on a 10-point scale with a 5.5 midpoint; specific 
scaling criteria are elaborated in the Trait Defini- 
tions Manual.* 

3. Records on academic achievement as measured 
by both the Metropolitan Achievement Tests and 
school marks for the school years ending in 1943 
and 1946 and by school marks for the school year 
ending in 1949. From these, decile rankings were 
made for rating the Ss on the fifth developmental 
task, “Developing Intellectual Skills.” 

The completed file foider for each S furnished ex- 
haustive interdisciplinary data that could not have 
been collected by an individual investigator. 


TREATMENT OF THE Data 
Quantitative Analysis 

It was hypothesized that, since most tasks 
are defined differently for each sex in terms 
of the culture pattern, it was logical to predict 
that different variables might be related to 
successful achievement of a task by each sex. 
Therefore, the factors and clusters of factors 
significant for achievement of the tasks were 
determined separately for the sexes. 

To attempt to ascertain factors that make 
for satisfactory accomplishment of the tasks 
being studied, the probability of a significant 
relationship between the items of the Trait 
Rating List as rated at age 16 and the rating 
on each task at that age was determined. 
This was done for the items rated on a 10- 
point scale by recording all the ratings on 
cards and sorting these separately for the 
sexes to determine the frequencies in a con- 
tingency table, using as the point of dichot- 
omy in the y variable ‘he median rating of 
the sex on the task, and as the point of 
dichotomy in the x variable the median rat- 
ing of the sex on the trait, with one degree 
of freedom in the tabie. An example of this 
procedure is given below: 

High Low 
(Above median (Below median 
rating on Item rating on Item 
4 Insight) 4 Insight) 
Above median rating 

on getting along 

with age-mates 

(outer) 

Below median rating 

on getting along 

with age-mates 

(outer) 2 6 


*To save printing costs, the Trait Definitions Manual, 
which contains the “Trait Rating List” and the defini- 


The significance level of these observed fre- 
quencies in relation to the theoretical fre- 
quencies was read directly from a table of 
significance levels recently constructed by 
Finney (1). This table has significance levels 
computed by the Fisher and Yates exact 
method for evaluation of the probability 
when small frequencies make the usual x? 
approximation unreliable. The object of this 
test is to determine whether the proportions 
of highs in the trait observed in the two 
groupings by relative achievement on the task 
differ more widely than might reasonably be 
expected if the population’s values of these 
proportions are equal. For example, when a 
significance level of .o5 is obtained for the 
2X2 contingency table illustration given 
above, one can be reasonably sure that the 
proportions observed show a trend that is 
not due to sampling error. There is propor- 
tionately greater assurance of this when the 
significance level is .or instead of .05, and 
still greater assurance if the significance level 
is .oor instead of .or. 

When it appeared from inspection of the 
distributions in the contingency tables that 
items not significant at the .o5 level in 
Finney’s table might be significant between 
the .o5 and the .20 levels, x* was calculated 
by the Fisher-Yates method. If this yielded 
a p value of .20 or less, the factor was retained 
as one which tended to be related to accom- 
plishment of a task. It is recognized that the 
less significant p values are only indicative of 
a trend and more careful, precise study with 
larger groups of Ss will be needed actually 
to substantiate their significance. 

For those parts of the Trait Rating List 
which were rated by apportioning a certain 
number of points among the possible items 
in order to represent their relative strengths 
in the individual, a different treatment of the 
data was needed. For “Sources of Morality 
Systems” and “Primary Identifications,” in- 
spection of the ratings showed that practically 
all points were assigned to either the mother 
or the father; thus, for these two parts of the 
List, the contrasting groups were compared 





tions of all items on it, has been deposited with the 
American Documentation Institute. Order Document 
No. 3714 from American Documentation Institute, 1719 
N St. N.W., Washington 6, D. C., remitting $1.co for 
35-mm. microfilm or $1.00 for 6 by 8 in. photocopies. 
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on these ratings by the x” method, using the 
following classification: 


Father- Mother- 
dominated dominated 


Above median rating on 
task 

Below median rating on 
task 


The other part of the List rated differently 
was “Social Roles.” In this case a rating was 
given if there was evidence for rating a per- 
son on a role; if there was no evidence, a 
rating of 1 was made. Since all roles were 
defined as having the character of being a 
group-perceived function of the person in the 
total original group, it was thought fitting to 
dichotomize the x variable (the social role) 
by arbitrarily rating as High those with a 
mean rating of 5.5 or over, and Low those 
under that figure. This seemed feasible 
inasmuch as ratings were made on the basis 
of only this group, not the total American 
adolescent population. 

From these sets of computations, lists of 
factors, significant at the .05 level or less and 
at the .20 level or less, that tend to cluster for 
each sex with each of the tasks were deter- 
mined. Patterns of the relationship of a fac- 
tor with the other variables in clusters about 
other tasks were also observed for each sex. 
The level of the p value was considered a 
cut-off point in making clusters of the items; 
those having the same p value were grouped 
together and studied in total relationship to 
accomplishment of the tasks. Thus, the fac- 
tors of greatest over-all importance for a sex 
for the achievement of these tasks at age 16 
were determined and the decreasing discrim- 
inatory importance of them was also sur- 
veyed. The number of tasks to which they 
were related and the strength of the relation- 
ship were also considered. 

In the interest of conciseness, variables sig- 
nificant at less than the .10 level are omitted 
from the tables in this report, but are stated 
in Schoeppe’s dissertation (6), and may be 
secured upon request. Although the .05 level 
is assumed as satisfactorily significant for the 
study, those factors significant at the .10 level 
are included in the tables because they so 
consistently reappear in such a clear pattern 
that they seem definitely to suggest the exist- 
ence of meaningful relationships in the data. 


Qualitative Analysis 

It is important to study individual patterns, 
for achievement by individuals on develop- 
mental tasks may be comparable, but the 
constellations of responsible factors often dif- 
fer widely. Some single aspect of personality 
may be selected for minute examination but 
the end is not a conglomeration of additive 
traits but a functioning totality, and an 
understanding of the whole is a prerequisite 
to a proper evaluation of any selected detail. 
Lewin, in pointing out shortcomings in using 
solely the statistical method, said: 

The very relation that is decisive for the investi- 
gation of dynamics—namely that of the actual in- 
dividual child in the actual, concrete, total situation 
—is thereby abstracted. . . . The concept of the 
average child and the average situation are abstrac- 
tions that have no utility for the investigation of dy- 
namics. Thus the environmental researches become, 
in general, the more fruitful the more attention is 
paid to a comprehension of the concrete total situa- 
tion instead of to the number of cases (4, p. 68). 


In studying forces operant in an individual 
in a developmental period, it is important to 
accumulate information about group tend- 
encies, but this alone is not sufficient if the 
dynamics operating within an individual in 
the group are to be understood. The con- 
stant interaction between an individual and 
his environment, as well as the influence of 
his past experiences upon present behavior, 
cannot be ignored. As Healy has aptly indi- 
cated, “The knowledge of personality for any 
theoretical or for any practical purpose must 
be obtained through utilizing the technique 
of historical perspective in the study of indi- 
vidual persons” (3, p. 30). 

For such reasons, the life histories of the 
six most uniquely achieving Ss were inten- 
sively studied in an attempt to depict the 
dynamics of the behavior of these 16-year-olds 
in our Midwestern American culture. The 
case studies were made by first checking for 
the number of variables which were found to 
be significant for the group for achievement 
on a task or tasks on which the S rated high 
or low, and likewise on the clusters and pat- 
terns on these items in his ratings. Then, 
through use of the original instruments, their 
analyses, and interpretations of these in Clini- 
cal Conference Summaries Numbers 1 and 2 
and Case Reports Numbers 1 and 2 made by 
the Moral Character Conference, an attempt 
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was made to discover the genesis of each 
factor in the personality, and the environ- 
mental and organismic conditions which nur- 
tured its maturation. 

To describe these adolescents one needs to 
view them as thinking, feeling, doing 16-year- 
olds, One needs to examine the psycholoyi- 
cal, social, and biological uniqueness of each 
individual and the interplay of these factors. 
In order to understand the individual’s 
present socialization, one needs to trace the 
etiology in each of these areas. One must 
look at the family picture, his physical de- 
velopment, his personality structure, his social 
relationships, the life trends through time and 
his achievement on the developmental tasks, 
and the particular interplay of factors that 
serves to cause a particular socialization pat- 
tern. For the sake of brevity detailed sum- 
maries of these case studies are not made in 
this paper,” but they are sketched here and 
the underlying hypotheses and some implica- 
tions of these are discussed. | 


REsuLts 


Quantitative Analysis 


By the nature of the data the factors de- 
lineated can only be stated to be related to 
the accomplishment of these developmental 
tasks at age 16 by these 30 adolescent boys and 


girls. However, on the basis of the longi- 
tudinal relationships on levels of achievement, 
which have been studied and reported (7), it 
should also be possible to draw some con- 
clusions with a fair degree of certainty for 
other ages. Whether these conclusions are 
valid for other adolescents or for most adoles- 
cents in our culture depends on their verifi- 
cation by comparable studies on other groups 
of “normal” adolescents. 

The variables that may have been impor- 
tant for the achievement of each task by these 
16-year-old boys and girls in Midwest are 
delineated in the seven tables in this paper. 
They show the relative importance of these 
factors and clusters of factors, and also some 
contrasts between the achievers on the tasks 
and those who did less well on them. 

Those factors found to be significant for 
each sex for achievement of the Age-Mates 
Task are presented in Table 1. To provide 


5 The complete case studies are included in Schoeppe’s 
dissertation (6). 


an illustration, the several major variables are 
discussed. Subsequently, the variables signifi- 
cant for each of the other tasks are presented 
in tabular form; in the interest of conciseness 
they are not discussed, but a summary of the 
general findings from this part of the study 
is substituted. 

Four variables are common to the sexes as 
important in good peer relations for these 


TABLE 1 


Variastes SIGNIFICANT FoR AcGe-Mates Task 








SIGNIFICANCE 
LeveL 


VARIABLE Mare Femae* 





Autonomy -001 -O1 
Acute, accurate observation of human 
behavior -O1 -O1 
Attitude toward same-sex peers—pos- 
itive outer feeling tone , -O1 
Rationality of behavior : -O8 
Sociocentric concern—identifies with 
group goals 
Realistic assignment of responsibility 
Realistic, balanced self-perception 
Rational-internalized, socialized mo- 
tivation 
Introspectiveness (inner) 
Attitude toward opposite-sex peers— 
Positive outer feeling tone 
Internal consistency of values (inner) 
Trustfulness (outer) 
Optimism (mainly inner) 
Creativity (outer) 
Functioning intelligence 
Insight 
Guilt—inner impulse 
Severity of parental control (outer) 
Supporter 





® Parentheses indicate negative relationship (high on 
the task and low on the trait). 


16-year-olds: acute and accurate observation 
of human behavior, rationality of behavior, a 
friendly feeling tone toward the same-sex 
peers, and autonomy. 

For the boys, independent and self-directed 
behavior, or autonomy, is essential. Factors 
unique to the boys center around a socio- 
centric orientation, introspectiveness to under- 
stand self, ego-differentiation of self and 
others, an accurate perception of “who does 
what,” and socialized motivations that have 
been rationally internalized. These factors 
point to a sincerely altruistic and fair person. 
The youth has a friendly feeling tone toward 
his same-sex peers, but his feeling toward 
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girls seems not to be crucial for good peer 
relations at this age. 

Factors unique for the girls are of a some- 
what different nature; many of them appear 
to explain a subdivision of the task which 
might be named “Getting Along with Oppo- 
site-Sex Age-Mates.” ‘The factors significant 
for the girls are not only important in hetero- 
sexual relations, but are essential in fostering 
them. These factors seem to support cultural 


TABLE 2 


VARIABLES SIGNIFICANT FOR EMOTIONAL 
INDEPENDENCE (OuTER) Task 








SIGNIFICANCE 
LeveL 


VARIABLE Mace Femare*® 





Insight .O1 .05 
Source of morality systems—same-sex 

parent 
Primary identification—same-sex par- 

ent 
Attitude toward father as authority 

—positive inner feeling 
Situationally - appropriate 

reactivity (inner) 
Acute, accurate observation of human 

behavior — -O1 
Attitude toward same-sex peers— 

friendly outer feeling tone mer .O1 
Attitude toward opposite-sex peers— 

friendly outer feeling tone ee .O1 
Autonomy coe -O1 
Rationality of behavior — .O1 
Internal consistency of values (inner) ... .O1 
Trustfulness (outer) ind -O1 
Optimism (mainly inner) _~ OI 
Creativity (outer) is .O1 
Functioning intelligence a .05 
Supporter res -10 
Guilt—inner impulse (.10) 
Severity of parental control (outer) (.10) 


emotional 





* Parentheses indicate negative relationship (high on 
the task and low on the wait). 


demands for the girl subtly to structure the 
environment to her own ends, inasmuch as 
the culture pattern does not condone her 
openly aggressing to do so. The 16-year-old 
girl successful in peer relations has positive, 
consistent life aims and values and expresses 
these creatively in action. 

Tables 2 through 7 point up the great dif- 
ferences in discriminating factors for the sexes 
and suggest the importance of cultural deter- 
minants in such differences. The great varia- 
bility in crucial factors for the sexes is one of 


TABLE 3 


VaRIABLES SIGNIFICANT FOR EMOTIONAL 
INDEPENDENCE (INNER) Task 








SIGNIFICANCE 
LeveL 
FEMALE 


VARIABLE MALE 





Insight -O1 .05 

Sources of morality systems—same- 
sex parent 

Primary identification—same-sex par- 
ent 

Attitude toward father as authority 
—positive inner feeling 

Situationally-appropriate emotional re- 
activity (inner) 

Acute, accurate observation of human 
behavior 

Impulse behavior identity—overt be- 
havior direct expression of spon- 
taneous inner impulse 

Autonomy 

Rationality of behavior 

Introspectiveness (inner) 

Creativity (outer) 

Functioning intelligence 





the most clear-cut findings of the study; it 
points up general differences in socializing 
adolescent boys and girls and suggests how 
our culture promotes these differences. 

Boys, it seems, are permitted in our culture 
much more freedom in the expression of 
their emotionality and sexuality. The code 
demands conformity from girls; they must 
express their feelings only in culturally ap- 


TABLE 4 


VARIABLES SIGNIFICANT FoR Sex Rote (Outer) Task 








SIGNIFICANCE 
Leve 


VARIABLE Mare Femare® 





Situationally - appropriate emotional 

reactivity (inner) 
Primary identification—same-sex par- 

ent 
Sources of morality systems—same- 

sex parent 
Attitude toward opposite-sex peers— 

outer feeling tone ave -0O1 
Attitude toward mother as authority 

—positive inner feeling ee" -O1 
Attitude toward father authority— 

negative feeling (inner) (.10) 
Guilt—inner impulse (.10) 





* Parentheses indicate negative relationship (high on 
the task and low on the trait). 
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TABLE 5 


VartaBes SIGNIFICANT For Sex Rote (INNER) Task 








SIGNIFICANCE 
Levet 
MALE 


VARIABLE Femace*® 





Situationally - appropriate emotional 

reactivity (inner) 
Primary identification—same-sex par- 

ent 
Sources of morality systems—same- 

sex parent 
Attitude toward opposite-sex peers— 

friendly inner feeling ée~ .O1 
Internal consistency of values (inner) box .O1 
Creativity (outer) eos -O1 
Attitude toward same-sex peers— 

friendly inner feeling ene +05 
Attitude toward father authority— 

negative feeling (inner) (.10) 
Guilt—inner impulse (.10) 
Severity of parental control (outer) (.10) 





® Parentheses indicate negative relationship (high on 
the task and low on the trait). 


proved ways and, characteristically to date, 
the culture has rewarded suppression of emo- 
tions, especially many positive ones. It has 
demanded that girls retain “little lady” char- 
acteristics while at the same time taking on 
a very different social role. In order to do 
this, girls are much more prudent and cir- 
cumspect in working through the tasks, and 
they attack them with a more thoughtful, 


TABLE 6 


Variastes SIGNIFICANT For CoNnscIENcE, Morats, 
Vatues Task 








SIGNIFICANCE 
Levet 
FEMALE 


VARIABLE MALE 





Realistic assignment of responsibility .o1 -O1 
Realistic balanced self-perception -O1 -O1 
Rational-internalized, socialized moti- 

vation OF Or 
Emotional stability (outer) -O1 .O1 
Acute, accurate observation of human 

behavior -O1 
Sociocentric locus of concern—identi- 

fies with group goals .O1 
Autonomy -O1 
Introspectiveness (inner) -O1 
Conformity to external code 
Potential intelligence 
Superego strength (inner) 
Functioning intelligence 
Insight 





calculated approach. Girls seem to be forced 
by the culture to grow up more by evolution; 
boys more by revolution. 

It is hypothesized that these cultural pres- 
sures on girls growing up may account for 
the results showing a residue of covert feel- 
ings which the girls are never given an 
opportunity to express. The whole question 
of the undesirability of emotional expression, 
which the American culture has tended to 
promote, has wide ramifications for both 
sexes, but it appears to be particularly impor- 
tant for girls. The apparent congruence of 
levels of overt and covert emotional independ- 
ence, but the lack of this same congruence in 
connection with the achievement of an ap- 
propriate inner (covert) and outer (overt) 


TABLE 7 


VARIABLES SIGNIFICANT FOR ACHIEVEMENT OF 
INTELLECTUAL SkiLts Task 








SIGNIFICANCE 
Leve. 
FEMALE 


VARIABLE Mae 





Functioning intelligence -O1 +05 
Potential intelligence -O1 coe 
Internal consistency of values (inner) .O1 
Creativity (outer) -O1 
Optimism (mainiy inner) -05 
Attitude toward same-sex peers—posi- 

tive outer feeling tone 
Emotional stability (outer) 





sex role, especially clearly shown for girls, is 
food for thought and re-evaluation of the 
cultural mores that confuse sex roles and curb 
socialization. 

The importance of autonomy in both sexes 
for satisfactory peer relations is shown; its 
somewhat greater importance for the boys 
stands out in this task in contrast to its im- 
portance for the girls in achieving emotional 
maturity. One also notes the need to have 
an active, participating role in the peer group 
if one is to have good relations with it. The 
findings reiterate the importance of good peer 
relations as being closely tied to all other de- 
velopmental tasks, and also show that good 
peer relations seem to have primarily an 
affect base. 

Within the limitations of the data, the 
importance of proper identification with the 
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same-sex parent highlights the entire analysis; 
this of course assumes that this same-sex 
parent has learned his proper sex role.’ The 
great importance of acute and accurate ob- 
servation of others’ behavior and insight into 
it is also revealed. The noting of a friendly 
attitude toward the same-sex peers as an im- 
portant discriminatory factor again empha- 
sizes the importance of proper identification 
patterns. 

On almost all tasks, emotional features, 
self-understanding, rational socialized moti- 
vation, and self-directed rational behavior are 
much more discriminatory than the level of 
functioning intelligence, although it fre- 
quently appears at the .05 level of significance. 
A high level of functioning intelligence, in 
turn, is dependent upon an absence or mini- 
mum of emotional blocks. The frequency of 
insight is also a highly relevant factor. It 
seems that more emphasis needs to be put 
on developing affective characteristics and 
learning personal and social adjustment, and 
less attention paid to intellectual ability 
and accomplishment per se, if one is to be 
well socialized in our culture. The impor- 
tance of emotional conditioning appears re- 
peatedly. This point again raises the vital 
issue of how to condition the child so he can 
express his emotions in ways approved by 
society, and a consideration of what existing 
societal mores may be stumbling blocks. 

Worth special note are those factors which 
strikingly differentiate achievers from non- 
achievers (i.c., p value of .oo1), and hence 
make a significant difference for these Ss. 
For girls, friendly outer feeling tone toward 
boys is essential for the sex role (outer) task; 
again cultural forces seem to be operant, 
since the reverse is not found. Outer con- 
formity to the external code is essential for 
girls for the conscience, morals, values task; 
the importance of this variable is emphasized 
by the fact that for no other task (for either 
sex) is this factor significant at more than the 
20 level. For boys, adoption of morality 
systems of same-sex parent is crucial for 
achieving both inner and outer emotional 
independence; and ego-directiveness is sig- 
nificant at .oor level of significance for boys 
who get along well with age-mates. These 

¢ All girls in the sample are identified with same-sex 


parent; there are seven boys whose primary identifica- 
tions are with the opposite-sex parent. 


conditions all focus on cultural pressures that 
make for sexual differences and reinforce the 
same pattern of differences, bending the girl 
toward submissiveness and conformity, and 
directing the boy toward autonomy. 

Those factors which were considered and 
found to be irrelevant for achievement of the 
tasks also tell a part of the story. In short, 
they seem to show that it is the covert feel- 
ings and motivations of the individual that 
bear much more weight than some of his 
overt, observable behavior; however, in some 
areas he finds it difficult to hide his inner 
feelings. 

Another conclusion comes from the con- 
tinual reoccurrence of the same findings 
rather;than from their high statistical signifi- 
cance., Doggedly reappearing is the fact that 
children whose parents exercised severe con- 
trol in their formative years were hindered 
on accomplishment of the tasks. Tied in- 
evitably with this type of home background 
was an inner feeling of guilt about impulsivity 
which, in turn, impeded mature emotional 
reactivity. Such severe parental control stifles 
the growth of emotional, personal security 
and results in the child’s sensing himself as 
worthless as an individual; when he feels 
ego-impelled this inner feeling brings with 
it guilt feelings which he must repress. Thus, 
one framework of adjustment—a healthy, 
positive, self-directing, confident self-concept 
—is stifled in its development. It is important 
to note, however, that this pattern appeared 
only in the analyses of the girls; this is prob- 
ably because homes in this culture have more 
severely controlled girls. This vicious circle 
has often been described in the literature 
in recent years; the present finding is but 
another confirmation of it. Summarily, those 
16-year-olds who are well socialized appear 
to be self-directed but not self-centered. 


Qualitative Analysis 


It is very important to accumulate infor- 
mation about the manifest components of 
behavior in relation to given factors, but this 
must be supplemented by additional insight 
to understand the particular dynamics oper- 
ating in the individual. Therefore, against 
the backdrop of significant factors, clusters 
and patterns of factors, case studies were made 
of six adolescents who uniquely achieved on 
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the five tasks, with the hope that such descrip- 
tions would vitalize the statistical analyses 
and translate some of these findings from 
abstractions to meaningful reality in terms of 
living adolescents. 

The rankings of the three boys and three 
girls studied because of their extremely high, 
low, or inconsistent rankings on the develop- 
mental tasks correspond on their rankings on 
the variables identified by the statistical analy- 
ses as significant for each task. They rank “as 
they should” on most of the factors, but some 
few instances do point up clearly the fact 
that occasionally a child achieves or does 
not achieve because one factor may greatly 
overshadow the others, either from environ- 
mental conditions or because of his own 
outlook—or, more often, from a combination 
of both. For both the detailed case studies 
and the specific rankings on each factor, the 
reader is referred to the full account of this 
study (6). Suffice it to say here that the 
six case studies show that the previous life 
experiences of these six adolescents related 
to the presence or absence of traits significant 
for achievement of the tasks in a meaningful 
pattern, and further that these experiences 
had tended to nurture or stifle development 
of these characteristics. ‘This, in turn, in- 
fluenced the individual’s concept of himself 
and his manner of approaching each new 
developmental task, and subsequently his 
accomplishment of the task. 


Highly Socialized Adolescents 


The highly socialized boy and girl have 
more nearly common patterns of factors 
and developmental life experiences than the 
others; but even here one sees sex differences 
operating, both in the individual and in the 
environment. 


Herbert Bond? has much less need to conform 
than does Ann Harper. Both were fortunate enough 
to grow up in homes where they were accepted 
and where they were made to feel they counted as 
individuals. The family relationships were tightly 
knit and both were able to see themselves as having 
secure places in the family. The fact that this was 
even more so for Herb reappears in his having even 
more desirable relations with age-mates than Ann 
does. Interaction of family members is harmo- 
nious, with the result that these children learned to 
maintain harmony in other social relations. They 
have been taught the meaning and value of work 


T Fictitious names are used for all Ss. 


and have internalized the need for working. Con- 
trols were exercised when necessary, but the par- 
ents attempted to help the child understand the 
rationality of them. The parents themselves were 
well adjusted and not unduly striving to maintain 
or change their status in society. Thus, since they 
were acceptant of their own mode of life, the 
identifying child likewise accepted his. On this 
foundation the youngsters began school, and be- 
cause they were well-socialized, the school eagerly 
accepted them and worked to further their social- 
ization. Each new successful accomplishment made 
the next one easier for Ann and Herb. 


For these well-socialized adolescents it 
seemed to be important that both had flexi- 
bility, maturity, and skill in interpersonal 
relations without too great emotional involve- 
ment; stable home backgrounds; and kind, 
interested, and accepting parents. Ann and 
Herb reflect comparable constellations of 
factors and achieve comparable healthy pat- 
terns of socialization. 


Poorly Socialized Adolescents 


The poorly socialized individuals have the 
same consistently low ratings at all ages and 
show clearly how one attacks each new de- 
velopmental task in the light of his previous 
experiences. 


Both Martha Gates and Ned Foster are narcissis- 
tic and starved for affection. They exert all their 
energies toward seeking the love of which they 
have been deprived, but they do so in very differ- 
ent ways, each the result of the unique combina- 
tion of biological, personality, and social factors. 
Their lot in early life was not good; neither had a 
sound home base from which to operate nor 
exemplary parents with whom to identify. 

Martha is an example of a child deprived in the 
physical, cultural, and intellectual areas. She has 
responded psychologically by passive submission 
and strives to make herself invisible in order to 
avoid any condemnation. Her low intellectual fa- 
cility, coupled with her lack of any keen powers of 
observation and insight, has caused her to be non- 
plussed and overwhelmed by her environment. This 
behavior may 2lso be more appropriate to the outer 
world which has been more disinterested and ignor- 
ing of her than actively rejecting. Mrs. Gates in 
particular has shown a desire not to be bothered 
by Martha, and a lack of interest in her rather 
than any active antagonism. In her school also she 
has been more ignored than scapegoated. Conse- 
quently in the home and in the school, there was 
actually little for Martha to fight back at except 
a nebulous rejection. Since she is not gifted with 
capacities for coping with her environment in a 
retaliative manner, she has simply succumbed—but 
has not been happy to do this. She is in a per- 
petual state of inner conflict, and thus uses energy 
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that might otherwise be put to more constructive 
ends to cope with it. Her rejection, poor family 
relations, and concept of herself as being worth- 
less have made her feel it futile to try to succeed 
as her peers do. She has adopted the fatalistic 
attitude that it is the world that causes her to lose; 
and since the whole world is against her, it is too 
much to try to remedy her situation. This attitude, 
coupled with her very limited abilities, has killed 
any zest for working on the common tasks for one 
her age. Martha has adopted an attitude of resigna- 
tion and despair. 

Ned Foster, possibly because of his greater in- 
tellectual ability and keener insight, takes steps to be 
retaliative to his poor lot in life. Moreover, Ned’s 
mother actively and openly rejects him, so he feels a 
greater need to retaliate and also has greater ca- 
pacities and imagination to devise means, both 
conscious and unconscious, to do so. Much of his 
energy is directed compulsively to expressing hos- 
tility toward his parents, particularly his mother. 
Because he gives vent to his hostile feelings in such 
negative ways, he continually handicaps himself—he 
does not win the affection he craves either from 
her or his age-mates. His failure to win affection 
makes him even more antagonistic and the vicious 
circle widens. Thus it has been for Ned through- 
out his school history, and his peer relations have 
become constantly worse. This, in turn, has made 
him more bitter and negative in his approach to 
all developmental tasks. In fact, he expends his 
energy aggressively fighting the world by means 
which are self-defeating; then he paranoically with- 
draws to his only real friends—animals and an older 
woman to whom he is attached—or he seeks solace 
in reading. Because of his superior ability, he 
achieves comparatively well academically in spite 
of these disturbing forces. 

Different constellations of factors account for Mar- 
tha’s and Ned’s poor achievement on the develop- 
mental tasks. However, their failures have made 
them more discouraged and more ready to strike 
out at the world, but each does it in the way his 
uniqueness prompts. 


Inconsistently Socialized Adolescents 


The inconsistent achievers are examples 
of the same levels of achievement on the 
same tasks, but the constellations of respon- 
sible factors, and even more strikingly their 
developmental histories, have been diver- 
gently opposed. 

Unfortunately, not all adolescents have 
healthy, desirable patterns to copy. More- 
over, parents sometimes.attempt to live out 
their conflicts through their children. Both 
conditions hinder the socialization process. 


Donna Swift is a striking example of one who 
found both problems major in her environment. 
Mrs. Swift was both very poorly adjusted and 
strongly desirous of solving her own problems 


through Donna, her eldest daughter. This seems 
to be a frequent picture of American family dy- 
namics, and it is usually the eldest son or daughter 
who must bear the brunt of the parent’s neurotic 
needs. Donna, cowed by her mother’s aggressive 
demanding that she conform to a rigid adult code, 
was not able to reject the patterns forced on her. 
It was also all-important for her to conform to 
the adult authority patterns of her school. Her 
identification with her mother causes her in turn to 
preach the adult code and antagonize her peer 
group, and makes her adhere to a somewhat dis- 
torted code. This further alienates her peers. She 
reacts by showing a proud isolation, and in order 
to remain acceptable in her own eyes, she “goes 
all out” for what she succeeds at, and thus mini- 
mizes the importance of her other failures to herself. 
This is especially important in her case, because 
from the beginning she struggled to achieve in her 
peer society. Unfortunately, her mother was not 
capable of understanding what was happening to 
Donna, nor did her teachers. She has the ability 
to be insightful into her dilemma, but it is likely 
she will not be, for her rigidity was adopted to 
protect herself against her inferiority feelings and 
her guilt over her strong unconscious antagonism to 
authority. Only deep therapy could hope to free 
her. 

Lloyd Abbott, while presenting the same com- 
pensatory achievement on the same tasks, is prob- 
ably using compensation as a desirable defense 
mechanism. When physiological maturation makes 
defense no longer necessary, he may readily, easily, 
and insightfully discard it. His family has generally 
provided a stable, secure base from which Lloyd 
has been able to establish life for himself, and he 
has moved in a rather wide area. The lad’s fairly 
clear-cut middle-class principles seem possibly to 
have come from the church as well as from the 
home. Lloyd accepts and lives by the conventional 
code of morality, apparently much more rationally 
than blindly. While part of his motivation may be 
to get along with as much comfort and as little 
stress or censure as possible, it also appears that he 
does it because it mzkes sense to him, and because 
ht likes people and intends to obserye their rights. 
Lloyd behaves morally largely because he believes 
in it—partly, of course, to insure his creature com- 
forts through social expediency, but not nearly so 
much as he feigns. He gets genuine satisfaction 
from his friendly interaction with others, and from 
an unconscious sense of doing the morally right 
thing. The general congruence of Lloyd’s seif- 
concept and his behavioral self is very high—he is 
what he thinks himself to be. In academic work 
he succeeds because of his superior intelligence, his 
efficiency, and because when there is pressure on 
him, he mobilizes his resources with a minimum 
of effort. 

The one area in which Lloyd’s development has 
lagged is in achieving an appropriate sex role, both 
overt and covert. This seems to stem from a general 
lack of drive and his passivity, in addition to his 
vague identifications with male figures. Possibly 
in this area added years will bring maturity, but 
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Lloyd does not have much push to grow up and 
assume the responsibilities of the heterosexual rela- 
tions that come with maturity. His moral life 
does not involve conflict, and he goes along enjoying 
his play and procrastinates with a casual, “I don’t 
date; I guess I’m just not interested.” This is per- 
haps an accurate appraisal by Lloyd of his adjust- 
ment—he is not interested and he does not know 
why, nor does he question why. From Lloyd's 
casual approach to all life, one would think that his 
inconsistent achievement on the tasks might be more 
coincidental than the result of any marked attempt 
at compensation. 

Donna and Lloyd portray the same patterns and 
equal over-all levels of achievement on the tasks, 
but this achievement has a very different meaning 
for each and a very different genesis and develop- 
mental history in each, Parental pressures were 
great in Donna’s case and she adopted their prin- 
ciples to give her a moral code at variance with 
her innner feelings. No such pressures were on 
Lloyd; he adopted his code because it made sense 
to him, and because he likes people and intends 
to observe their rights; his principles are reality- 
oriented and well integrated and there is a spon- 
taneous inner feeling behind his social action. Both 
achieve in academic areas for more nearly similar 
reasons; both do it to please adults. But Donna 
needs this as a compensation for her low peer 
status, whereas Lloyd is careful to be acceptable to 
his peers. Donna's overt sexual behavior likewise 
exemplifies her compensatory mechanism whereas 
Lloyd’s is probably largely retarded development. 
Donna is neurotically compensating on tasks to 
remain acceptable to herself when failure to de- 
velop in all areas in early adolescence, traceable 
to the parental pressures, alienated her peers. Lloyd 
is compensating constructively and positively. 


In conclusion, several other questions are 
raised by the inconsistent achievers: Is it 
significant, or merely fortuitous, that the 
equivalent rankings for the inconsistent 
achievers are on the same developmental 
tasks? Is this the most common compensa- 
tory pattern for youth in American culture? 
If so, why? Does the fact that the conscience, 
morals, values task and the developing ot in- 
tellectual skills task, both perhaps intellectu- 
ally based, again cluster here while the sex role 
tasks, both overt and covert, are at the lower 
extremes in rating have significance for the 
answer? Although no final conclusions can 
be drawn from these data, it is hypothesized 
that this is a pattern which is influenced by 
the biological and personality forces in the 
individual and promoted by our societal codes 
and institutions. 


SUMMARY 


The concept of developmental tasks en- 
compasses the general problem of child- 
rearing in a culture. The aim of research 
pertinent to this concept will be to discover 
more of the variables that make for doing 
this effectively, and then to devise means 
for facilitating the socialization process. This 
study suggests some variables germane to 
adolescent socialization in a Midwestern 
American community. It does this by statis- 
tically treating rich interdisciplinary case data 
on 15 sixteen-year-old boys and 15 sixteen- 
year-old girls to isolate personality factors 
and social roles that are significant for the 
sex groups for the accomplishment of five de- 
velopmental tasks. In addition, six uniquely 
achieving cases are intensively studied to 
show the dynamics of a concrete situation 
and its importance. Each of these case 
studies aligns well with both the normative 
data and the dynamic approach. The find- 
ings stress the greater importance of emo- 
tionality and its expression in outlets satisfy- 
ing to the individual and acceptable to the 
society. If the adolescent is to accomplish 
successfully the developmental tasks required 
in his society, it is imperative that he master 
his impulsivity and accept himself; so that 
he can mobilize his energy to deal effectively 
with the social and cultural forces which 
impinge upon him. 
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GROUP PLAY THERAPY WITH MENTALLY RETARDED 
' CHILDREN * 


BENJAMIN MEHLMAN 
Kent State University 


HE purpose of the current study is to 
| investigate psychological changes, as 
indicated by performance on the Re- 
vised Stanford Binet, Form L, the Grace 
Arthur Point Scale of Performance Tests, 
Revised Form II, the Rorschach, the Hag- 
gerty-Olson-Wickman Behavior Rating Scale, 
and the California Test of Personality, evoked 
in institutionalized endogenous mentally re- 
tarded children as the result of a nondirective 
group play therapy experience. 

Specifically, it is hypothesized that (a) posi- 
tive therapeutic results, as determined by the 
measures used in this study (the Rorschach, 
the California, and the Haggerty-Olson- 
Wickman Behavior Rating Scale) will occur 
as the result of nondirective group play 
therapy with institutionalized endogenous 
mentally retarded children; (4) that increases 
in intelligence, as determined by the measures 
used in this study (the Revised Stanford 
Binet, and the Grace Arthur Point Scale of 
Performance Tests, Revised Form II) will 
occur as the result of nondirective group play 
therapy with institutionalized endogenous 
mentally retarded children; and (c) that 
there is a relationship between changes in 
intelligence (as measured by the Binet and 
the Grace Arthur) and changes in person- 
ality (as measured by the California, the Ror- 
schach, and the Haggerty, et al. Behavior 
Rating Scale). 


Review of the Literature 


Relatively few studies have been concerned 
with therapy and the mentally defective child. 
Generally, qualified opinion is that mental 
deficiency is largely of an immutable charac- 
ter and beyond limited psychotherapeutic aid. 
White (36), Maslow and Mittelman (26), 
Alexander and French (2), and Lurie (25), 
to mention a few, have expressed or implied 


1 Submitted in partial fulfillment of the requirements 
for the degree of Doctor of Philosophy in Clinical 
Psychology in the Graduate School of Syracuse Uni- 
versity, January, 1951. 
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“ 


this view. Benda says that “... the en- 
dogenous type is a developmental disorder 
starting in prenatal life and is, therefore, 
somewhat removed from an efficient thera- 
peutic influence after birth” (7, p. 183). 
Rogers states that for therapy the client must 
possess “. . . adequate intelligence for cop- 
ing with his life situation, with an intelli- 
gence rating of dull normal or above” (28, 
p. 74). Snyder, in commenting on this, notes 
that “actually, most of the more successful 
published cases have been those of persons of 
better than average intelligence . . . most 
therapists agree that at least dull normal intel- 
ligence is advisable” (30, p. 474). Axline (4) 
notes that little work has been done with the 
mentally handicapped child and recommends 
the active participation of the parents in work 
with this type of child.” 

The general statement that can be made 
from a survey of the published literature is 
that there have been few well-designed and 
effectively carried out experimental studies. 
The ambiguity of these results has done little 
to reconcile the diverging views concerning 
whether psychotherapy with the mentally re- 
tarded child is successful or unsuccessful. 
The studies that have been done have been 
primarily concerned with the refutation or 
demonstration of change in the areas of per- 
sonal adjustment and intellectual perform- 
ance. There is some suggestion in the liter- 
ture of favorable personality adjustment as 
the result of therapy, but this is ambiguous. 
The evidence with regard to changes in meas- 
ured intelligence due to therapy is also open 


‘to question. Nor is there much that is well 


defined in the literature concerning the rela- 
tionship of personality and intelligence under 
therapeutic conditions, this relationship more 
often being assumed than demonstrated. 


2 Published reports of cases may be found in references 
1, 5, 6, 8, 9, 10, 11, and 12, Experimental reports 
may be found in references 2, 13, 14, 16, 17, 19), 20, 
23, 37, and 38. 
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METHOD 
Selection of Children 


All of the children in the present study were from 
the Syracuse State School for the Mentally Retarded. 
The author had access to the records of all of the 
children in the institution. He searched these, 
selecting children on the basis cf three criteria: 
(a) they were white, (4) they were between the 
ages of 5 and 12, and (c) they had been diagnosed 
by the institutional staff as “familial” in type. 
These cases were then studied individually and sev- 
eral cases were eliminated from the study when the 
author felt that there was more than ample room 
for disagreement about the diagnosis. To check on 
the accuracy of this double diagnosis (by the insti- 
tutional staff and the author) an abstract was made 
of the case folder material available for each child, 
and these abstracts were submitted to three judges 
(one neurologist and two psychologists) for con- 
firmation regarding the diagnoses of endogencity. 
Using agreement between two of the three judges, 
as the criterion, question could be raised about only 
one of these children. 

For the 32 children as a whole, chronological ages 
varied from 86 to 140 months, with a mean age of 
120.8 months; length of residence in the institution 
varied from 4 months to 48 months, with a mean 
length of residence of 18.0 months; and the order 
of birth ranged from first to ninth. The environ- 
ment was rural for 5 of the 32 children, and urban 
for the remaining 27. As the criterion for con- 
sidering a member of the family mentally deficient, 
the author used residence at any time in an insti- 
tution for mental defectives or being termed men- 
tally deficient in the social worker's case history 
write-"» on the basis of personal interview or test 
informauon. Only one child had no mentally 
defective relatives in his immediate family. 


Design 


The design of the study included three groups— 
Group Play Therapy, Group Inactive, and Group 
Movie. Group Inactive had no contact with the 
experiment other than to have pre- and posttesting 
experiences. Group Movie, it was felt, was neces- 
sary to control for two factors: the influence of the 
therapist’s personality as such, and disturbance of 
the institutional routine. It is not kere implied that 
therapy is carried on divorced from the personality 
of the therapist. The trend is to recognize the 
importance of the therapist’s personality and in fact 
to capitalize on it. It is implied, however, that 
while there is a core of behavior common to all of 
the activity of the therapist, yet there are important 
differences too. Group Movie was also considered 
necessary on the basis that one might justifiably 
have speculated that any positive changes that might 
have occurred had taken place merely because the 
children in the experimental group had the institu- 
tional routine relieved, and not because of the thera- 
peutic influence in the experimental group. To 
control for this “common core” of behavior and 
attitude in the therapist, and also to provide as 


much vicarious experience as was possible, Group 
Movie was set up. Several short movies were shown 
or stories read during 14 one-hour sessions. A 
descriptive behavioral account of what actually 
occurred was written by the author shortly after 
each session. Although it had originally been hoped 
that observers might be secured for this group, none 
were, because of the lack of available time on the 
part of potential observers. 


Testing 


All pretesting was completed in the six-week 
period before play therapy began, and all post- 
testing was concluded in the six-week period fol- 
lowing the end of play therapy. Two examiners 
were used, both of whom had been recommended 
as competent for the specific tests they administered. 
The MHaggerty-Olson-Wickman Behavior Rating 
Scale was completed for each child by the academic 
teachers of the institution. Qualitative comments 
regarding the test behavior of each child were 
solicited from the examiners. The examiners were 
aware of the general nature of the study, but they 
did not know either at the beginning or at the end 
of testing which children were in which group. 

So far as pretest results are concerned, Binet IQ’s 
ranged from 52 to 78, with a mean of 64.9. Per- 
formance Quotients on the Grace Arthur, Revised 
Form II, ranged from 46 to 90, with a mean of 67.1. 
Raw Scores on Schedule A of the Haggerty-Olson- 
Wickman Behavior Rating Scale ranged from 12 
to 146, with a mean of 53.1. The higher score here 
indicates greater personality difficulty. On the Cali- 
fornia Test of Personality raw scores ranged from 
10 to 40, with a mean of 26.6. 


Grouping 


Three groups of individually matched children 
were formed by the following procedure: the sum- 
mary face sheets of the Rorschach of each child 
were given to eight competent judges, with instruc- 
tions to rate each child in terms of his adjustment 
on a five-point rating scale. To establish reliability 
the eight judges were randomly assigned to two 
groups, and a Pearson correlation was computed 
between their combined ratings. For the first 
administration the correlation between the two 
groups of judges was .79. For the second adminis- 
tration, the correlation was found to be .77. Cor- 
relations of this magnitude would occur through 
the accidents of sampling in less than one sample 
in a hundred. When correlations between the rat- 
ings of pairs of judges were combined, using the 
z transformation (27), it was found that the com- 
bined 2’s of the obtained magnitude, for both the 
pre- and posttestings, would occur through the 
accidents of sampling in less than one sample in 
a hundred. 

The ratings for each child by all of the judges 
were then summated, and a rank order of adjust- 
ment in terms of these summated ratings was made. 
Every third child was then placed in the same 
group. When ties in rank did occur, they were 
resolved by consistently placing the older child first. 
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By this procedure the children were equated in 
terms of judges’ ratings of their total initial 
adjustment. 

To determine whether the groups were equated 
in terms of their lengths of residence, chronological 
ages, and test performances, analysis of variance 
technique was used (29). Save for the Z and P 
number categories on the Rorschach the anull 
hypothesis could not be rejected. Use of the 
analysis of covariance technique in handling the 
results reported below statistically equated the 
groups for pretest standings. 
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scale and to record a descriptive prose account of 
what had actually occurred during the sessions. 
The orientation of this running account was be- 
havioral rather than interpretative or evaluative. 
The therapist completed parallel reports, and all 
reports were completed as soon after each session 
as was practicable. 


Resutts * 


The first hypothesis states that there are no 
statistically significant changes in person- 


TABLE 1 
Prescores AND Apyustep Gains For THE Various Measures oF Persona.ity * 








Pray THERAPY 





PRETEST 


MEAN GaIn 


ADJUSTED 


MoviE INACTIVE 








ADJUSTED 
Gain 


PRETEST 
MEAN 


ADJUSTED 
Gain 


PRETEST 
MEAN 





California ** .09 .56 
Haggerty *** .18 62 
Rorschach Sum of Judges’ Ratings .70 .23 
Rorschach Response Number -70 -9I 
Rorschach .95 -51 
Rorschach 70 .30 
Rorschach F% -71 .03 
Rorschach .40 84 
Rorschach .65 .60 
Rorschach P Number 3.40 -75 
Rorschach Number W . 80 -03 
Rorschach Number D -70 —.64 
Rorschach Number Dd .10 1.89 
Rorschach T/7:R 20.80 —5.50 
Rorschach T/1R Black 22.40 —7.12 
Rorschach T/:R Color 19.00 —3.77 
Rorschach 7/7R Black Color 5.00 —.Oo1 


—4.15 25.82 
-48 50.91 
-06 15.36 

24.20 3.31 23.91 

4-55 41 5-59 

65.20 .O1 61.09 

67.30 31 65.18 

45-30 55 47-55 

5.85 -39 13-73 
2.40 -70 1.55 
2.00 -79 3.36 

17.50 3.55 17.09 

4-70 .08 3.36 

18.00 .56 17.09 

20.50 .68 18.64 

14.70 . 36 15.18 

9.00 -95 6.09 


26.90 
42.89 
14.90 





* For the computations of adjusted gains see Snedecor (29, p. 322). 
** The items on the California were summed to make one composite score. 


of better adjustment. 


A lower score is indicative 


*** Only Schedule A was used. The lower score is indicative of better adjustment. 


There were thus three fairly well matched groups. 
Since it was felt that 11 children would be too many 
to work with efficiently within a single play therapy 
session, the 11 children in Group Play Therapy were 
divided into two equivalent groups using the same 
procedure of summated judges’ ratings of their 
Rorschach face sheets. 


Therapeutic Process 


Nondirective group play therapy, along the lines 
described by Axline (4, 5, 6) and others (28, 18, 
24) was carried out with Group Play Therapy by 
the author. The play therapy groups were seen 
twice a week for 50 minutes each session for a 
total of 29 sessions over a 16-week period. An 
attempt was made to have an observer present in 
the playroom during each play session. This did 
not prove administratively feasible (because of the 
unavailability of observers). As it worked out, one 
observer was present for fourteen sessions, one for 
three sessions and one for just one session. The 
function of the observers was to complete a rating 


ality, as determined by the measures used 
in this study, as a result of a nondirective 


8 A fairly extensive account of the actual happenings 
during the play therapy and movie group sessions is 
contained in the dissertation, including a discussion of 
the limits of the therapy situation, relevant qualitative 
factors, and a summary of the behavior of each child 
during the play therapy sessions. Included too is a 
description of the procedures used to establish the relia- 
bility of the summaries and the observations. Briefly, 
the summaries appear reliable, and while complete 
agreement was not obtained in regard to every dimen- 
sion rated, there appeared to be considerable agreement 
between the therapist and observer observations as indi- 
cated by the rating scale used. Pertinent here are the 
findings that although there were important differences 
in the play behavior of the two piay therapy sub- 
groups, there was an essentially nondirective role played 
by the therapist in each of the subgroups. The only 
statistically significant change on the objective tests used 
when the two play therapy subgroups are compared is 
the greater change toward better adjustment evidenced 
by one of the groups on Schedule A of the Haggerty- 
Olson-Wickman Behavior Rating Scale. 
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group play therapy experience with institu- 
tionalized endogenous mentally retarded 
children. Table 1 indicates the relevant 
quantitative data. Analysis of covariance 
technique was used to equate statistically the 
groups on the initial testing. The mean pre- 
test scores and the mean adjusted gains (27) 
are reported. The F ratio for the HOW 
Behavior Rating Scale, Schedule A, is signifi- 
cant at well below the .o1 level of confidence, 
and the F ratio for the F°% of the Rorschach 
is significant at slightly below the .05 level of 
confidence, when the three experimental 
groups are compared simultaneously. These 
are the only statistically significant F’s ob- 
tained. In terms of the results on the Be- 


TABLE 2 


PERFORMANCE ON THE MEASURES OF INTELLIGENCE 








PLAY 
ITEM THERAPY Movige INACTIVE 


Group Play Therapy and Group Inactive, 
and Group Play Therapy and Group Movie 
were not significant. The adjusted #’s for 
the comparisons were 3.259, 1.712, and 1.284. 
The second hypothesis states that there are 
no statistically significant changes in intelli- 
gence, as determined by the measures used 
in this study (the 1937 Rinet, Form L, and 
the Grace Arthur, Revised Form II) as the 
result of nondirective group play therapy ex- 
perience with institutionalized endogenous 
mentally retarded children. Table 2 indicates 
the mean pretest and the mean adjusted gains 
for each of the experimental groups. Neither 
of the variance ratios was significant. We 
may conclude that there were no statistically 
significant changes in intelligence as deter- 
mined by the measures used in this study. 


TABLE 3 


INTERCORRELATIONS BETWEEN CHANGES ON D1FFERENT 
MEASURES FOR THE THIRTY-TWO CHILDREN 





Binet 
Pretest 
mean 65.09 
Adjusted 
gain — .63 
Grace Arthur 
Pretest 
mean 71.75 
Adjusted 
gain +40 





havior Rating Scale by Haggerty-Olson- 
Wickman and the F% on the Rorschach at 
the respective levels of confidence cited, we 
must reject the null hypothesis that there 
was no difference in the changes effected in 
personality. 

In terms of the Haggerty, and using an 
adjusted £(27), the differences between 
Group Play Therapy and Group Inactive 
were significant at below the .05 level of con- 
fidence, and the difference between Group 
Play Therapy and Group Movie was not 
statistically significant. The difference be- 
tween Group Movie and Group Inactive was 
not statistically significant. The adjusted ¢’s 
for the three comparisons were 4.951, 1.638, 
and 2.397 respectively. 

For pi PF, of the Rorschach, the adjusted 
zt for the difference between Group Inactive 
and Group Movie was significant at the .05 
level of confidence. The differences between 





Grace CAati- Hac- Ror- 
Test BineT ARTHUR FORNIA GERTY sSCHACH ® 





Binet -124 -121 .092 -002 
Grace Arthur -017 .084 -175 
Cal:fornia —.146 -325 
Rorschach * .0g8 





* Summiated judges’ ratings. 


The third hypothesis states that there is 
no relationship between changes in intelli- 
gence (as measured by the Binet and the 
Grace Arthur) and changes in personality 
(as measured by the California Test of 
Personality, Schedule A, of the Haggerty- 
Olson-Wickman Behavior Rating Scale, and 
summated judges’ ratings of Rorschachs). 
Here, because the relationship between gains 
or losses on the different measures is in focus, 
regardless of the causal agents assumed to be 
operating, the data for the 32 childrea have 
been grouped together. Table 3 indicates the 
intercorrelations for all of the children for 
each pair of measures. None of these rela- 
tionships is statistically significant. We may 
conclude that for the 32 children of the 
present study there was no evidence of any 
statistically significant relationship between 
changes in intelligence and changes in per- 
sonality, using the already indicated meas- 
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ures. (Parenthetically we might note that 
there was little agreement between changes 
on the different measures of personality or 
between changes on the different measures of 
intelligence.) 


Discussion 


The Therapeutic Process 


The aim of nondirective therapy is to pro- 
vide a situation where the client will be free 
from threat, free to examine and differentiate 
aspects of himself, his environment, and his 
relationship to his environment, and free to 
achieve new or different syntheses or integra- 
tions of meanings, and thus, finally, a more 
adequate adjustment. This adjustment is 
considered to be largely a function of the sort 
of self-perceptions the individual has and of 
the manner in which these are organized. 

The aim of therapy is the evolution of the 
“adequate self” (31). There are three aspects 
especially of nondirective therapy and phi- 
losophy that make essential contributions to 
this goal. First, there is ossumed to exist a 


drive toward psychological equilibrium or 
health, a drive that is not free to operate 
under great threat. Strong threats function 


to freeze or rigidify the individual’s percep- 
tions and meanings, and hence his behavior, 
and result in inadequate adjustment. Therapy 
acts to free the organism from threat. This 
drive toward the “maintenance and enhance- 
ment of self” (31) is assumed to exist in the 
mentally retarded as well as in his more for- 
tunately endowed neighbors. 

Second, in nondirective therapy the thera- 
peutic process is viewed as primarily emo- 
tional rather than intellectual in nature. In 
nondirective practice there is no direct 
attempt made to give the child insight, nor 
does the child necessarily concern himself 
directly with his problem. Fleming and 
Snyder (18) observed that the children they 
treated made few, if any, references to their 
personal problems, or to members of their 
family, and the like. In the current study, 
while there were some references by the chil- 
dren to their farnilies and their feelings 
toward the institution, the surprising thing 
is that there were so few such references. 
The assumption here is that the mentally re- 
tarded child, despite his intellectual limita- 
tions, can grow in therapy because therapeutic 


progress does not depend on_ intellectual 
ability. 

Third, in nondirective therapy the relation- 
ship between therapist and client is viewed 
as crucial. The personality of the therapist 
is not denied or negated, but rather is uti- 
lized. The therapeutic relationship is marked 
by warmth, stability, and permissiveness. 
The client is accepted as he is and the struc- 
ture of the relationship is such as to facilitate 
acceptance of the client by himself. Tech- 
niques are subordinate to the relationship. 
In this relationship the focus of the therapist 
is the feeling being expressed, and one of the 
functions of the techniques used is to induce 
or create an atmosphere where there can be 
free expression of such feeling. 

In the current experiment the therapeutic 
relationship varied greatly from child to 
child, not only in terms of the frequency of 
contact between the therapist and the child 
in the same sessior., but also in terms of the 
quality and development of the relationship. 
Some children seemed to wish to avoid any 
sort of contact for long periods of time. With 
some children, although the contacts were 
limited in number, they were laden with vital 
meaning for the children. With others the 
meaning of individual contacts could be 
ascertained only over a span of time, for they 
showed little involvement with any single 
interchange. The differences in relationships 
are suggestive of the fact that different chil- 
dren could use the therapist and the thera- 
peutic situation in unique fashions. 

Not only do children appear to make 
unique use of the therapeutic situation, but 
also it appears that different children start 
therapy from different points. Theoretically at 
least, it would seem to follow, then, that dif- 
ferent children may have different optimal 
points of termination of the group play 
therapy sessions. It may well be that the 
therapeutic experience was continued long 
enough for some of the children to show 
demonstrable results but not for others. 

It is worth noting that there were several 
factors that appeared to militate against an 
ideal nondirective situation; ¢.g., too many 
children in the play therapy groups for the 
therapist to reach all in the manner desired, 
incompatibility of some of the children, the 
emotional turbulence created in some chil- 
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dren by the violent expression of aggression 
of others, institutional staff complaints to the 
children about the noise emanating from the 
play room, and other probably less important 
factors. 

Although there were important qualitative 
differences between the two play therapy 
groups, of the five objective tests there were 
statistically significant differences only on the 
Haggerty. Here the group that was more 
aggressive, less cohesive as a group, and more 
involved in contact with the therapist made 
the greater gains in adjustment. There was 
a lack of associated change on the Rorschach 
and the California. 


The Nature of the Criteria 


No brief is made for the essential validity 
of the tests used in this study as the criteria 
for the evaluation of therapeutic gain, nor 
even necessarily for the reliability of the tests 
used. The Rorschach, for example, requires 
a good deal more research of a controlled and 
statistically sophisticated nature than has been 
forthcoming to date (15). The basic stand- 
ardization data for the Grace Arthur Revised 
Form II have not yet been published (3). 

In our own determination of agreement 
between changes in the different measures, 
grouping the 32 children together (since we 
were simply interested in changes, however 
caused) we found that there was statistically 
insignificant association between the Binet 
and the Grace Arthur. There was no statisti- 
cally significant agreement among the dif- 
ferent measures of personality either. Because 
of the small numbers, the homogeneity of the 
children and the relatively small range within 
which changes occurred, the correlations may 
be spuriously low. Even granting the influ- 
ences of the above factors, it would seem that 
there was little agreement between changes 
in the measures of personality, or between 
changes in the measures of personality and 
intelligence. 

Over and above the question of agreement 
among different measures, we know of no 
universally trustworthy criteria for evaluating 
therapy. Even psychological factors such as 
insight may not only be the result rather than 
the cause of therapy, but may conceivably also 
lead to poorer test results. (Increased insight 
may be synonymous with, or may lead to 


greater awareness of one’s personal problems, 
for example, and thus seemingly indicate 
negative therapeutic results on a personality 
questionnaire when the therapeutic results 
may actually be positive in the sense that the 
client has achieved some insight.) Even if 
there are no changes on test data, we are not 
justified entirely in concluding that there 
have been no therapeutic changes. It might 
well be that the instruments used are too 
crude to pick up what may be subtle but 
important changes that may have occurred. 

Involved too is the question of who should 
make the evaluations. If there is disagree- 
ment between the client’s evaluation of suc- 
cess and/or the therapist’s evaluation of 
success, and/or the objective test data, which 
is right? Should the evaluations be based on 
symptom disappearance or on changes in the 
configurational aspects of what is presumed 
to be the deeper personality level ? 

Even if test data did show a perfect cor- 
respondence with our subjective observations, 
there is far from universal agreement that 
increased aggression or decreased aggression, 
or increased sociability or decreased socia- 
bility, for example, is categorically a sign of 
therapeutic gain or loss. There is no univer- 
sal agreement that the withdrawn child is 
better adjusted because she later enters into 
more social activity or that the formerly con- 
stricted child is better because she later ex- 
presses aggression freely. 

Involved in this problem of therapeutic 
evaluation, too, is the question of when does 
one apply the criteria to determine thera- 
peutic change—immediately after therapy has 
been concluded (is it terminated by the 
therapist at some arbitrary point, or by the 
client, also possibly at some arbitrary point), 
six months later, or two years later? 

For practical purposes at the present time, 
the best that can be done is to recognize the 
limitations of the criteria used and to proceed 
with what seems reasonable, being certain to 
state explicitly the criteria used. 


The Quantitative Results 


There were significant changes in person- 
ality, as evidenced by our measures, only on 
the Haggerty and the F°{ of the Rorschach. 
Group Play Therapy moved toward increased 
adjustment, in terms of the Haggerty, signifi- 
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cantly more than did Group Inactive, but the 
differences between Group Movie and Group 
Inactive and Group Play Therapy and Group 
Movie were not statistically significant, 
although the differences in this last compari- 
son were in favor of Group Play Therapy. 
Thus, of the three major measures of person- 
ality used, improved adjustment is reflected 
on only one. 

Unfortunately, there are several factors 
which enter to contaminate the Haggerty 
data, some of which cancel others but to 
unknown degrees. One of these factors is the 
fact that the children of Group Play Therapy 
became identified with the study for the 
teachers who did the ratings, because the play 
sessions were held during school hours, and 
the children had to leave and return to the 
classroom. It is true that the teachers knew 
all of the children who participated in the 
study (for they made ratings before therapy 
was begun) but it is likely that the children 
in Group Movie (where the sessions were 
held after school hours) and Group Inactive 
were not as easily identified with the study. 
Any biases operating in the teachers with 
regard to the experiment might reasonably be 
expected to appear, although to an unknown 
degree, in the ratings they made. But even 
if the bias operating was to “help” the experi- 
menter, the enormous changes for some of the 
children make it difficult not to accept at least 
the direction and some of the changes as real. 
In this particular study (and this contributes 
to our failure to dismiss the Haggerty gains 
for Group Play Therapy as simply contami- 
nated) there also is evidence to indicate that 
the attitude of the teachers and of certain 
staff members was not entirely favorable. 
The evidence is in the form of requests from 
the teachers via the children that there be less 
noise, remarks from attendants and other 
staff members marvelling at the activity and 
the amount of noise issuing from the play- 
room, and in the attitudes that may be pre- 
sumed to be operating when a teacher’s class 
is interrupted periodically by the leaving and 
returning of the pupils. Over and above this 
is the threat probably inherent to the institu- 
tional personnel in considering a different 


4It is important to emphasize here that if there 
was any disruption of the play therapy atmosphere, it 
was due to a lack of intimate knowledge of the study 
rather than to any lack of spirit of cooperation. 


approach to children. Thorne (35), for 
example, speaks of the negative attitudes on 
the part of the personnel of an institution 
when a less discipline-oriented program was 
being introduced. The logical possibility 
also exists, however, that if in the beginning 
these children constituted problems of a sort 
to the teachers, the thought that they were 
being given treatment of one form or another 
may have resulted in closer scrutiny of the 
children involved with an expectation of 
change, and/or possible changes in the atti- 
tudes of the teachers toward the children, 
and/or objective recognition of real changes 
due to therapy. In view of the fact that con- 
firming evidence is not available in the form 
of objective test data where the teachers and 
the problems inherent in an evaluation of 
their ratings were not involved, we must hold 
the gains shown by Group Play Therapy sug- 
gestive but not conclusive of change. 

Using analysis of covariance technique the 
F°% of the Rorschach was the only other 
measure used that showed statistically sig- 
nificant changes between our experimental 
groups. 

Using a ¢ for the comparison of two groups 
at a time, the differences between Group 
Inactive and Group Movie were found to be 
significant at between the .or and .02 levels 
of confidence. The differences between 
Group Play Therapy and Group Inactive, 
and Group Play Therapy and Group Movie 
were not statistically significant. The vari- 
ances were approximately equal for all com- 
parisons. These differences between the 
three experimental groups would seem to 
indicate that the trend in institutional en- 
dogenous mentally retarded children, in this 
institution at least, is toward an increase in 
F°% and that the procedure carried on in 
Group Movie had a diminishing effect on the 
F°%, whereas the procedure in Group Play 
Therapy had no comparable effect. ‘The 
analysis of covariance ratio was significant at 
just slightly below the .05 level of confidence. 
Since there was no confirming evidence on 
either the Rorschach or the other measures 
of increased “intellectual constriction” or 
“anxiety” associated with the increase in F°/, 
it is difficult to explain these changes on 2n 
other than chance basis. 





60 BENJAMIN MEHLMAN 


SUMMARY AND CONCLUSIONS 


Thirty-two institutionalized, endogenous, 
mentally retarded children, divided into three 
individually matched groups, were used in an 
investigation into the personal and intellec- 
tual changes, and the interrelationships be- 
tween such changes, evoked as the result of 
an experience in nondirective, group play 
therapy. Play therapy was carried on with 
one of the groups, while the other two groups 
served as control groups. Statistically signifi- 
cant increases in adjustment were found for 
the play therapy group only on the Hag- 
gerty-Olson-Wickman Behavior Rating Scaie. 
There was no statistically significant change 
for any of the groups on either the Binet or 
the Grace Arthur. In grouping the 32 chil- 
dren together to determine the relationships 
between changes on the different measures 
no statistically significant relationships were 
found. 
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CHANGES IN INTERPERSONAL PERCEPTIONS FOLLOWING 
SOCIAL INTERACTION * 


JAMES BIERI 
The Ohio State University 


interaction of two individuals. It 

attempts to measure the effects of this 
interaction in terms of the changes produced 
in the way each participant perceives the 
other. The general problem posed is: How 
does the way we perceive another person 
change as a result of interacting with that 
person? No attempt is made to relate the 
perceptual changes to variables such as the 
particular kinds of group processes or roles 
which the individuals take. However, the 
present study does set out to demonstrate that 
measurable changes do occur and that these 
changes have directionality in terms of the 
individual’s perceptions of himself. 

The specific hypothesis to be tested in this 
study has evolved from two systematic corol- 
laries of the Psychology of Personal Con- 
structs? The first of these states that a 


Tie research is concerned with the social 


person’s perceptual system varies as he suc- 
cessively construes or perceives events. The 
other corollary involved here is that to the 
extent that one person understands another, 
he may play a role in a social process with 


that other person. Assuming that under- 
standing another individual involves predict- 
ing his behavior, a person’s predictions of 
another’s behavior may be said to reveal how 
he perceives that other individual. If it is 
assumed that our perceptions of others are 
constantly changing as a result of social inter- 
action, then, over a period of time, a series of 
our predictions of the behavior of others 
should reflect the changes in our perceptions 
of them. 

During the course of a period of free re- 
search, an attempt was made to determine 
the possible dimensions upon which these 

1 This articl= is based on a thesis presented in partial 
fulfillment of the requirements for the degree of Master 
of Arts at The Ohio State University. The writer is 
indebted to Dr. George A. Kelly for his assistance and 
guidance in all phases of this research. 


2Ketry, G. A. The psychology of personal con- 
structs. Mimeographed manuscript. The Ohio State 


Univer., 1951. 


perceptual changes occur. It was noted 
that when group members, after intervals 
of interaction, were asked to predict each 
other’s responses to the situations depicied 
on the Rosenzweig Picture-Frustration Study, 
the predictions of responses shifted toward 
greater agreement with their own responses. 
It may be expected that many variables are 
playing a part in producing the changes in 
perception. These variables might include 
the types of relationships in the roles the 
participants play in the interaction situation, 
the nature of the task given the participants, 
and the participants’ prior acquaintanceship 
with each other. 

In the present experiment, a_ relatively 
simple type of social interaction was selected 
to produce the changes in interpersonal per- 
ceptions. Two individuals of the same sex, 
who were relative strangers to each other, 
were put into an interaction situation char- 
acterized by considerable freedom of ex- 
pression. The experimenter structured the 
subjects’ interaction so that they discussed 
and came to conclusions on common ex- 
periences and preferred activities. This type 
of interaction could be described as a con- 
structive role-relationship experience in which 
attainment of mutual agreement and accept- 
ance is emphasized. As the interaction pro- 
ceeds, a progressive change in each of the 
participants’ perceptions of the other may be 
expected. Specifically, it is hypothesized that 
the changes will be in the direction of per- 
ceiving the other individual as more similar 
to oneself, as a result of increasing agreement 
with and knowledge about the other person. 
This change in perception following inter- 
action can be measured by comparing suc- 
cessive predictions of another’s behavior. 

The following experimental hypothesis is 
investigated: An individual will perceive 
another person as more similar to himself 
after a period of constructive interaction than 
before this interaction has occurred. 
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METHop 


Subjects 


Students in the beginning undergraduate course 
in psychology at the Ohio State University were 
the subjects (Ss) for the experiment. Inasmuch as 
this course is required for all undergraduates in the 
university, the Ss represented a variety of major 
areas. All students in the course are required to 
serve as Ss for psychological experiments. The Ss 
went through the experimental procedure in pairs. 
The members of each pair were of the same sex 
and an attempt was made to obtain each member 
of a pair from different sections of the course. 
A few of the pairs were students in the sare section 
with only a nodding acquaintanceship, and the 
majority were total strangers to one another. Half 
of the pairs were assigned to an experimental group 
and half to a control group. Table 1 indicates the 
number, age, sex, and marital status of experimental 
and control groups. 


TABLE 1 


Ace, Sex, anp Marirat Status or ExperimMENTAL 
AnD ConTrot Groups 








Tora N MEAN 


Group N Mate Femarte CA 





Experi- 
mental 26 16 10 
Control 26 16 10 


19.91 
19.97 





Predicting Instrument 


The criteria for selecting the predicting instru- 
ment were: (a) that it be of a fair degree of interest 
to the S in terms of social stimulus value and (4) 
that it be relatively short and easy to administer. 
For this purpose, a modified version of the Rosen- 
zweig Picture-Frustration Study (P-F Study) was 
selected. The usual pictures for the P-F Study 
were used. However, a multiple-choice response 
system was devised. Three possible responses for 
each picture were given: one of the three being 
intropunitive, one impunitive, and one extrapuni- 
tive. No attempt was made to control the number 
of ego-defensive, obstacle-dominant, or need-per- 
sistive responses on the multiple-choice alternatives. 
The scoring manual for this test (3) lists the most 
frequent types of responses to each picture and for 
each of the three categories. These responses were 
utilized in the present multiple-choice system. In 
cases where no responses of a certain type for a 
given picture were listed in the manual, the write: 
made up a response of that type for the situation 
depicted. The responses were presented to Ss in 
rotated order so that one type of response did not 
always appear in the same position among the three 
alternatives. The rationale behind the type of 
response sclected, the frequency with which certain 
responses occurred in a given protocol, and shifts 


from one type of response to another played no part 
in the present study. 


Interaction Situation 


The primary requirements for the interaction 
situation were: (a) it should be a constructive role 
relationship, i.c., both members contribute to the 
discussion of a common task on which some agree- 
ment is to be reached and in which some degree of 
mutual acceptance is obtained, (4) it should provide 
for a maximum of interaction in a relatively short 
period of time, and (c) the problems to be dis. 
cussed should be of such a nature as to be common- 
place or ordinary in terms of everyday experiences. 

To this end, two topics for discussion were 
selected. The first of the two situations had to do 
with a discussion of several aspects of the psychol- 
ogy course both students were taking. Such topics 
were selected, in part, because they centered about 
a common experience of both members and as such 
could serve as an “ice-breaker.” 

The second topic of the interaction situation had 
to do with an hypothetical vacation which the two 
Ss were to spend together. Although this deviated 
from the ordinary inasmuch as it involved imagin- 
ing they had‘ plenty of money, it did involve the 
interests and desires of each participant in an under- 
taking which was conceivably within the life possi- 
bilities of both of them. 


Experimental Group Procedure 


The two Ss were brought into a room and seated 
at different desks in such a manner as not to be 
directly visible to each other without turning. Each 
S entered identifying information on a slip of paper. 
He was then given the modified P-F Study de- 
scribed above, indicating his answers on an answer 
sheet. Instructions to Ss were: 

You notice there are 24 situations depicted in this 
booklet. On this other sheet, there are three responses 
indicated for each of the 24 situations. You are to 
select from the three responses presented for each 
picture the one response which you feel you would 
most likely give in the situation depicted. On the 
answer sheet provided, write the letter a, 6, or ¢ of 
the response you select opposite the number of the 
picture. Remember, for each picture select the response 
you feel you would most likely give in the situation 
depicted. 


At times, an S would ask, “Do you mean the 
response I should give or would give?” In such 
cases, the directions were repeated with emphasis 
upon the words would most likely give. Queries 
were few, and most Ss grasped the directions 
immediately. 

Following this initial self-response to the P-F 
Study, the experimenter (E) asked Ss if they knew 
each other. This was usually followed by negative 
replies from both members. Then E asked each S 
in which section of the psychology course he was. 
The responses to these few questions formed the 
basis of the early interaction of the two individuals. 
Occasionally, spontaneous remarks were made by 
one or both partners, but in all cases verbal com- 
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munication between the partners was minimal at 
this stage of the experiment. 

Subsequent to the two queries of E, the Ss were 
told that although they didn’t know each other 
very well, each was now to fill out the P-F Study 
as he felt the other individual filled his out. Re- 
sponses to this request were varied. Most Ss hesi- 
tated slightly and then proceeded to work. Others 
laughed slightly or smiled as they began. A few 
protested mildly or stated that such a task was 
difficult because they didn’t know each other. In 
such cases, E repeated the instructions, adding that 
they should try as best they could. In all cases, this 
additional statement was sufficient to start the most 
skeptical Ss to predict how the other person filled 
his out. 

Following this first phase of the experiment (the 
self-response and first prediction), Ss were seated 
at the same table, face to face, and the first of the 
two interaction situations was presented. The 


Almost all Ss followed these directions without 
question, but a few asked for more information as 
to what they were to do. The E would then explain 
that they were to fill out the test as they felt the 
other individual had originally filled his out. 

The following are the three phases of the experi- 
mental group procedure: (a) self-response and first 
prediction on P-F Study, (4) two 10-minute inter- 
action situations, and (c) second prediction on P-F 
Study. 


Control Group Procedure 


Phases (a) and (c) were the same for both con- 
trol and experimental groups. However, instead of 
the 20-minute interaction period, Ss in the control 
group remained seated at separate desks with their 
backs to each other. They were given the same 
stimulus material concerning the psychology course 
except the directions were given in the singular. 


TABLE 2 


ComMPaRISON OF MEAN “AGREEMENT WITH SELF” Scores FoR EXPERIMENTAL AND ConTROL Groups 








AGREEMENT WITH SELF. 


STAND. 





First PREDICTION 


SEconD PREDICTION 


Error 
MEAN 





Group N MEAN SD 


MEAN SD 


Dirr. 


Diff. 





Experimental 26 4-22 
(Interaction) 


Control 
(No Interaction) 26 


13.50 


13.88 4-34 


15.81 3-13 2.31 


14.57 


0.62 


4-64 0.69 0.40 





* p=.001. ' 
S° g==.50. 


students were asked to discuss their psychology 
course along four main lines: (@) what they liked 
about the course, (4) what they disliked about the 
course, (c) how they felt the course could be 
improved, and (d) what they thought the goals of 
the course should be. This discussion lasted 10 
minutes. The E was present unobtrusively to one 
side and did not participate. 

The second interaction situation was then struc- 
tured for the pair of Ss. They were told: 


Imagine this is the end of the quarter and you both 
have two weeks to spend together on a_ vacation. 
Imagine also that between the two of you, you have 
ample financial resources. You are to discuss where 
you would like to go, how you would get there, and 
what types of things you both would enjoy doing 
together on the vacation. 


This interaction also lasted ro minutes, following 
which the participants were stopped and the third 
phase of the experiment was instituted. In this 
last phase, Ss were again seated at separate tables 
and given the same P-F Study materials as before. 
Then the following directions were given: 


Now you've had a chance probably to get to know 
each other a little better than you did before. Now 
I would like you to fill this out again as you feel 
the other person filled his out. 


In this way, S was concerned only about himself 
and not also about the other person. The directions 
for the vacation were: 


Imagine this is the end of the quarter and you 
have two weeks to spend on a vacation. Imagine also 
that you have ample financial resources. You are to 
decide where you would like to go, how you would 
get there, and what types of things you would enjoy 
doing on the vacation. 


The Ss were directed to write on both topics. 
The time allotted for these tasks was the same as 
for the experimental group. 


REsuLTs 


Prediction Results 


Table 2 presents the results for both experi- 
mental and control groups. The two columns 
headed “Agreement with Self” refer to the 
number of responses on an individual's first 
and second predictions respectively which 
were the same as that individual's self-re- 
sponses. There is a total possible agreement 


of 24 responses. It is important to realize 
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that the results presented are in terms of each 
individual’s predictions compared with his 
own self-responses on the test. 

The experimental hypothesis states that 
there will be greater agreement with self- 
responses on the second than on the first 
prediction. Utilizing the difference scores 
between the second and first predictions, #’s 
were calculated for both experimental and 
control groups according to the formula for 
correlated means. The value obtained for the 
experimental group, t=3.73, with 25 degrees 
of freedom, is significant at the .oor level of 
significance, and the hypothesis that the mean 
population difference is zero must be rejected. 

The value of ¢ obtained with the control 
group, t==1.72, with 25 degrees of freedom, 
is significant above the .10 level of signifi- 
cance, indicating that the rejection of the null 
hypothesis with the control group is not 
reasonable. 

A test was made to determine if there was 
a significant difference between the mean 
difference values of the experimental and 
control groups. Employing the usual ¢ test 
for uncorrelated means, a value of t—=2.20, 
with 50 degrees of freedom, was obtained. 
This value is significant at the .o4 level and 
indicates that the hypothesis that no differ- 
ence exists between the mean difference scores 
of the experimental and control populations 
is scarcely tenable. 

In this respect, it will be noted that both 
groups had a mean difference in the hypothe- 
sized direction, even though that of the con- 
trol group was not significant. Two possible 
explanations for this are: (a) some interaction 
did occur between the control group partners 
during phase two, perhaps of a nonverbal 
nature, or possibly, (6) the positive difference 
for the control group represents the unrelia- 
bility of the test materials used. 

The distribution of Ss in the control and 
experimental groups who did and did not 
fall in the hypothesized direction was ana- 
lyzed using chi square. The usual procedure 
of dividing the chi-square table so that the 
marginal totals are approximately equal was 
employed. The cut-off score thus selected 
was a difference score of +2. The value of 
chi square obtained, 7.69, with one degree of 
freedom, is statistically significant at less than 
the .or level. 


Interaction Analysis Results 


Because specific interaction variables were 
not directly involved in the present study, 
detailed observation and recording of the 
interaction situations were not made. How- 
ever, with E as observer, eight categories 
adapted from the Bales interaction scale.(1) 
were used in recording the group interaction. 
These categories were: “shows solidarity,” 
“shows tension release,” “agrees or passively 
accepts,” “gives information, suggestion, or 
direction,” “asks for information, suggestion, 
or direction,” “disagrees,” “shows tension,” 
and “shows antagonism.” Each time an S 
interacted verbally, his verbalization was tal- 
lied in one of the eight categories. Prob- 
ably because of the nature of the interaction 
situation, only three of the eight variables 
occurred with any great frequency. These 
three variables, in addition to the total num- 
ber of verbal interactions of each S, are pre- 
sented in Table 3. The term “negative cases” 
in Table 3 refers to those Ss whose percep- 
tions of their partner showed no change or a 
change in the direction of greater dissimi- 
larity with their self responses. Likewise, 
“positive cases” refers to those Ss who showed 
a change in the hypothesized direction, i.e., 
toward greater similarity with their self- 
responses. From Table 3, it is apparent that 
little difference exists between the positive 
and negative cases in any of the four cate- 
gories presented. There is evidence that, in 
the interaction situation, a constructive role 
relationship was obtained in which mutual 
agreement and acceptance is emphasized. 
The mean number of “agreement and accept- 
ance” responses of the experimental group 
was 10.38, while the mean number of “dis- 
agree” responses was only 0.88. 

Perhaps most significant among the nega- 
tive cases in the experimental group is the 
fact that three of the six negative cases were 
Negroes. Furthermore, they were the only 
Negroes in the experimental group. In all 
three cases, the Negro S was interacting with 
a white S. It may well be that the specific 
role relationship played out in the Negro- 
white interaction was of such a rigid, cul- 
turally defined nature that the Negro S did 
not feel free to construe or perceive his white 
partner as more similar to himself. That 
such a rigid role relationship did exist is 
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indicated by the findings (Table 3) that in 
two of the three cases (Ss g and 11) the 
Negro S had much less talk interaction than 
the white partner (Ss 10 and 12 respectively). 
Qualitative notes on the interactions involv- 
ing these two Negro Ss state that in each case 


are realized. Although it has social and 
pictorial stimulus value, the P-F Study re- 
stricts itself to those social situations involv- 
ing some degree of frustration. The fact that 
many social relationships involve little or no 
frustration will limit generalizations made 


TABLE 3 


Versat Interaction Data For ExpeRIMENTAL Group 








Susyacr Tora. 


VERBALIZATIONS 


n 
m 
“ 


Asks FOR 
INFORMATION, 
SUGGESTION, 
DtrecTIon 


Gives 
INFORMATION, 
SUGGESTION, 
DrirEcTION 


AGREES, 
PAssiIvVELY 
Accepts 





50 
54 
48 
51 
52 
71 
56 


f 
f 
f 
f 
m 
m 
m 
m 
f 
f 
f 
f 
m 
m 
f 
f 
m 
m 
m 
m 
m 
m 
m 
m 
m 
m 


Total (N=26) 
Mean 
SD 

Positive Cases (N=20) 
Mean 
SD 

Negative Cases (N=6) 
Mean 
SD 


I 
7 
7 

27 

13 

17 

13 
2 
6 

10 

17 

10 

14 

20 

12 
8 

18 

12 

12 
3 
5 

10 
2 


18 3 
39 
30 
22 
25 
39 
32 
28 
20 
25 
36 
43 
46 
31 
17 
28 
44 
27 
21 
27 
47 
41 
37 
40 
38 
25 


CACHAN OW OW SaNNNW HK VSO DAS OH AGH 


31.77 
8.82 


31.95 
15.42 


31.16 
13.92 





* Negative cases. 
+ Negro subjects. 


the Negro played the submissive partner in 
a social process dominated by the white S. 
Such findings are only suggestive as to what 
factors may have been operating in the actual 
role relationships. 


Discussion 


Methodologically speaking, the limitations 
of the P-F Study as a predictive instrument 


from this study. It may be that other types 
of predictive measures, such as the rating 
scale method used by Dymond (2), may have 
more applicability when a more global, less 
socially restrictive type of prediction is 
desired. 

The findings suggest that the more pheno- 
typical interaction variables used in this study 
to record group interaction may not ade- 
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quately differentiate group process in terms 
of role relationships. A more genotypical 
variable structure, preferably stemming from 
a systematic position, would probably provide 
more meaningful explanations of what oc- 
curred in the interaction situation. 

The results of this research lend themselves 
to various interpretations and extensions. It 
could be stated, for example, that the in- 
creased similarity with self-behavior in the 
prediction of another’s behavior after inter- 
action is an “identification” phenomenon. 
The two persons, interacting in a rather 
friendly manner, identify with each other in 
terms of perceiving the other as similar to 
oneself. Or, the results could be interpreted 
as indicating that “projection” has taken 
place, inasmuch as Ss have attributed to their 
partners their own behavioral responses. 
Lacking specific evidence concerning the 
interpersonal processes or affective relation- 
ships of the partners, it seems reasonable to 
construe the results along the lines indicated 
by the experimental hypothesis. Only by 
more precise measurement and greater under- 
standing of the variables within the type of 
interaction structure of this research can 
more theoretical embellishments be added to 
the results. 


SUMMARY 


The following hypothesis was investigated: 
In a constructive interaction situation, one’s 
perception of another will change in the 
direction of increased similarity to oneself. 
Interaction between two strangers of the same 
sex was employed. The experimental and 
control groups each consisted of 13 pairs of 
Ss. After taking a modified multiple choice 
Rosenzweig Picture-Frustration Study, each 
S predicted his partner’s responses on this 
test. This was followed by group interaction 


for the experimental group and individual 
written tasks for the control group. Then a 
second prediction of the other’s responses on 
the Picture-Frustration Study was filled out 
by each S. 

The results show a significant increase in 
similarity to self between the first and second 
predictions for the experimental group. The 
control group showed a slight, but statisti- 
cally insignificant increase in similarity to 
self-responses. In comparing the two groups, 
the experimental group showed a signifi- 
cantly greater increase in mean similarity 
than the control group. 

Data collected on the group interaction 
variables show no discrimination between 
positive and negative experimental Ss. Three 
of the six negative experimental cases were 
Negroes who int racted with white Ss. 

1. The study demonstrates the possibility 
of measuring interpersonal perceptions in 
groups as influenced by the type of inter- 
action situation. 

2. The findings of the study are construed 
as supporting the hypothesis that in a con- 
structive group interaction situation in which 
mutual agreement on experiences and pre- 
ferred activities is emphasized, members 
come to perceive their partners as more simi- 
lar to themselves. 

3. It is possible that more rigidly defined 
cultural roles, such as Negro-white inter- 
action, may inhibit perceiving others as simi- 
lar to oneself. 
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A FURTHER STUDY OF THE EFFECTS OF THERAPIES ON 
MMPI RESPONSES * 


WILLIAM SCHOFIELD 


University of Minnesota 


His study is essentially a replication of 

an investigation reported earlier in 
which an analysis was made of the 
changes in responses to the Minnesota Multi- 
phasic Personality Inventory (MMPI) of psy- 
chiatric patients undergoing different types 
of therapy (3). Limitations of available data 
made it necessary to limit the original investi- 
gation to female patients exclusively. It was 
considered desirable to check the findings of 
the initial study by making parallel analyses 
of the MMPI records of male patients as 
appropriate data became available. This 


paper reports the results of such analyses. 
The purpose of both studies has been to 
explore the manner and degree to which the 
MMPI measures the effects of various thera- 
peutic regimens and, hence, the extent to 


which it might prove a useful instrument for 
evaluating patient progress and therapeutic 
efficacy. 

It appears reasonable that a test which was 
designed for diagnostic purposes and which 
includes measures of extent of deviation 
along certain major axes of psychiatric in- 
volvement should reflect, in those very meas- 
ures, changes in the personality of the 
individual as he becomes more sick or as he 
improves with the passage of time or the 
effects of therapy. It is possible, however, 
that attention to changes in scale scores alone 
might not detect more subtie indices of 
change which could be uncovered only by 
study of item responses. For this reason 
these studies have been concerned not only 
with changes in the profile of scores on the 
clinical scales of the MMPI, but have included 
extensive analysis of “item behavior” of pa- 
tients when tested over certain intervals and 
of the relationships of item-response changes 
to score changes. 


1 This study was supported in part by a research grant 
from the Graduate School of the University of Minnesota. 
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SuBjEctTs 


The subjects (Ss) used in this study, all males, 
were selected so as to be as directly comparable as 
possible to the Ss of the previous research except 
for sex. Data will be reported for four groups: 
University of Wisconsin student controls, University 
of Wisconsin student outpatients, psychoneurotic 
patients who were hospitalized but received no 
electroshock therapy, and psychotic patients who 
received electroconvulsive therapy (ECT).? Certain 
gross descriptive data for these male Ss are pre- 
sented in Tables 1 and 2. In the case of the patient 
groups, as was true for the female study, all hos- 
pitalized Ss were considered improved at time of 
discharge. No uniform progress data were avail- 
able on the hospital clinic outpatients. 

Table 1 reveals that the Wisconsin male controls 
and clinic patients are somewhat younger than the 
female normals and outpatients of the previous 
study. Also, the intervals between the two MMPI 
administrations in these male groups are roughly 
twice the length of those which obtained for the 
comparable females. The male hospitalized neu- 
rotics of this study differ from the hospitalized 
female neurotics of the original analysis in being an 
average of ten years younger, in being hospitalized 
for a much shorter period, and in having a briefer 
interval between the two Multiphasics. The male 
psychotics of this study treated with ECT are some- 
what younger than the comparable female sample, 
have a slightly longer interval between MMPI’s, and 
a very similar mean length of hospitalization. The 
average number of ECT treatments administered to 
the patients of this study was approximately ten, 
which is an average of about three more than were 
received by the female sample. With respect to the 
variables of age, length of hospitalization, and 
interval between the pre- and posttherapy MMPI 
profiles, it is clear that the male Ss of this study 
show divergence from the central tendencies of the 
previously reported female samples, but these 
divergences are not great. 

Table 2 reports the diagnostic composition of the 
three male clinical groups. The outpatient group 
shows a marked concentration of anxiety states and 
obsessive-compulsive syndromes and is therefore 


2 The writer is indebted to Mr. Peter Kaufmann for 
generously providing the test data of the two University 
of Wisconsin samples (2). The hospitalized neurotics 
and psychotics were treated by the Division of Psychiatry 
of the University of Minnesota Hospitals. 
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TABLE | 


Gross Descriptive Data ror THE Mae Susyects * 








Days tn Hosp. 


Days BETWEEN MMPI's o/ 
4° 











Grourp**® N 
MEAN 


RANGE 


RANGE MEAN RANGE SINGLE 





UWC 27 
UWP 24 
PN 10 
PS-S 22 


17-33 
19-29 
17-55 
18-68 


27-5 
49-5 


32-236 

53-395 
4-59 30 
5-84 50 


144.3 
182.4 
22.0 
34.6 


70.0 
36.4 


9-63 
17-115 





* For comparable table, see Table 2 in original monograph (3). 


** UWC—University of Wisconsin students, nonpatients; UWP—University of Wisconsin 


student outpatients; 


PN—Nonhospitalized psychoneurotics treated in the University of Minnesota Outpatient Clinic; PS-S—Psychotics 
hospitalized at University of Minnesota Hospitals and treated with electroconvulsive therapy. 


+ High school education or better. 


very different from the corresponding female 
sample. The hospitalized neurotics show good 
diagnostic similarity to the corresponding females. 
The male psychotics who received ECT are likewise 
very similar in diagnostic composition to the com- 
parable female sample. 

In considering the findings as to profile and item 
changes associated with the particular therapies of 
the male groups of this study and in comparing 
these changes with those that obtained in the pre- 
vious investigation of females, the above similarities 
and differences of major aspects of the comparable 
samples must be kept in view. 


REsuLts 
Profile Changes 


Figures 1, 2, 3, and 4 show the mean 
pre- and posttest profiles of the four groups. 
Those mean differences having statistical re- 
liability are indicated directly on the profiles.* 

From Figure 1 it may be noted that seven 
of the nine scales show a mean tendency 
toward lower scores in the control students 
after an average interval of five months. 
The changes are slight and for the most part 
not reliable. In the case of the Hy and Ma 
scales, however, the controls show a reliable, 
though very small, decrement in scores. On 
the Hy scale, the mean decrease in score 
amounts to less than two raw score points 
(%=1.82), but the test-retest consistency of 
scores on this scale in the control group was 
apparently so high that the associated stand- 


8A table reporting the mean pre- and posttherapy 
scale scores, standard errors of the differences and the 
accompanying tests of significance, and other tables 
reporting the statistical findings of extensive analyses 
which are not reported in detail in this paper may be 
obtained from the American Documentation Institute, 
1719 N Street N.W., Washington 6, D. C. Order 
document No. 3659 remitting $1.00 for 35 mm. micro- 
film or $1.00 for 6 by 8 in. photocopies. 


ard error of the mean difference is quite 
small (ir, ==.582), and the resulting critical 
ratio indicates the change, though slight, to 
be stable. Likewise, the mean change in raw 
scores on the Ma scale was 1.33 points; the 
standard error of this difference was 622. In 
general, the test and retest profiles of the male 
controls of this study are very similar to those 
of the female normals in the earlier study. 

Figure 2 presents the mean pre- and post- 
therapy profiles of the Wisconsin student- 
patient group. Again, a very consistent trend 
to lower scores on retest is observed with the 
mean changes in this treated group of neu- 
rotics being of greater magnitude than was 
found in the normals. Changes on the Pd, 
Mf, Pa, and Ma scales are not statistically 
reliable. The mean test-retest score decre- 
ments among the remaining clinical scales 


TABLE 2 


COMPOSITION OF THE CLINICAL Groups BY DIAGNosIs 








= 


Group D1acnosis 





UWP (N=24) Anxiety tension state 
Compulsive-obsessive 

Hysteria 

Depression 
Schizophrenia—incipient 
Adolescent reaction 
Psychoneurosis, mixed 
Psychoneurosis, reactive depression 
Psychoneurosis, psychaesthenia 
Schizophrenia, paranoid 
Schizophrenia, mixed 
Manic-depressive, depressed 
Involutional melancholia 
Paranoid condition 


PN (N=10) 


PS-S (N=22) 


NUANN NUW we ew Ne 





*For comparable table, see Table 3 in original 
monograph (3). 
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are all highly reliable. It may be noted that 
the mean scores show a distinct change in the 
direction and degree of slope of the F-K 
differential. This is very similar to changes 


in these scales found to be associated with 
successful therapy of the female Ss. The 
changes in the mean K scores and F scores 
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are statistically reliable. It appears that by 
virtue of diagnostic considerations, nature 
and extent of therapy, and length of therapy, 
the male, student-outpatient group shows 
more extensive personality alterations than 
are reflected in the posttherapy MMPI pro- 
files of the female psychiatric outpatients, 


Pa Pt Sc Mo 


Fic. 2. MEAN Pre- anp PosrrHerapy MMPI Prorixes oF 24 Psycniatric OuTPpATIENTs Wo RECEIVED AN 
AVERAGE OF 14 PsYCHOTHERAPEUTIC INTERVIEWS 
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who showed no reliable changes on any of 
the scales after an average of five visits to a 
large outpatient psychiatric clinic. 

The effect of an average of one month of 
hospitalization in a small, well staffed and 
equipped psychiatric unit on the MMPI pro- 
files of male neurotics is shown in Fig. 3. 
The consistent tendency to lower retest scores, 
even in the absence of markedly extreme 
values initially, is seen again. The changes 
in mean score are very small and, with the 
exception of that on the Hs scale, not reliable. 


yo Hs 
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D Hy Pd Mf 


In general, the group changes in the MMPI 
pivfiles of these four samples of males show 
good agreement with the changes shown by 
comparable female groups of the earlier 
study. The particular behavior of the profiles 
of specific therapy groups may not be attrib- 
uted simply to selective sex factors. Stability 
of group profiles in the normals of both sexes 
appears to be very good, with reliability of 
certain scales approaching a magnitude which 
generates statistical confidence in very small 
changes in mean scale score. 


Po Pt Sec Mao 


significant at) 5%]! 


Mean Pre- anp PostrHerapy MMPI Prorires or 10 PsycHoneurotic Patients WHo WERE 


HosP!raALizep FoR AN AVERAGE OF OnE MoNTH 


The mean change in raw score on Hs was 1.7 
points; the standard error was 650. In com- 
paring these data with those obtained from 
hospitalized female neurotics, it is interesting 
that they also produced their only reliable 
mean change on the Hs scale. 

Figure 4 demonstrates the effects of an 
average of ten ECT treatments over a mean 
interval of 35 days on the MMPI profiles of 
male psychotics. Again, the marked changes 
are very similar to those found with female 
patients treated with ECT. The changes in 
mean clinical scores of the profiles of Fig. 4 
are significant at the .or level except for the 
Ma scale. In the previous study of females, 
all clinical scales except Mf and Ma showed 
highly reliable changes following ECT. 


The finding of marked changes in the 
group standings of the male, student out- 
patients, with reliable improvement indicated 
on five of the nine clinical scales following 
therapy, requires comment in view of the 
failure of the female outpatients of the pre- 
vious study to show significant changes on 
any of the variables. The male Ss were 
younger by an average of nearly ten years 
and, by virtue of their selection, were un- 
doubtedly brighter than the female clinic 
patients. Furthermore, the interval between 
MMPI’s and the extent of therapy contacts 
was generally greater for the males. All four 
of these variables, age, intellect, interval, and 
intensity of treatment, favor the males and 
would support greater anticipation of favor- 
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able changes in the male records than in the 
female. Finally, the greater “accessibility” 
of the student counseling center and the 
more effective coordination of efforts possible 
within the campus environment as contrasted 
with the clinic, which must treat people far 
from their home base, is to the comparative 
detriment of the efforts of the hospital clinic. 
In consideraticn of these variables, we would 
be led to be relatively optimistic as to how 
the MMPI might reflect “response to therapy” 
in the student center, and relatively pessimis- 


2. & Be Hs 


110 
100 


D Hy Pd Mf 


provement, the MMPI gives good evidence 
of the therapeutic change. In situations in 
which we have some reason to doubt thera- 
peutic efficacy and there is no clinical judg- 
ment of uniform patient improvement, the 
MMPI remains unchanged. These findings 
support the hypothesis that the MMPI may 
be used as a measure of response to therapy. 


Item Response Changes 


In order to analyze possible changes in the 
patterns of MMPI responses which might not 
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significant 


Fic. 4. MEAN Pre- anp PostrHerapy MMPI Prortres or 22 Psycuortic Patients WHo Rece!veD AN AVERAGE 
OF 10 ELECTROCONVULSIVE TREATMENTS 


tic as regards the outpatient clinic. The data 
fit these considerations. Furthermore, recog- 
nizing these sources of group-to-group vari- 
ance, we may not conclude that male patients 
improve while females do not, or that tke 
type and extent of psychotherapy available 
through a hospital outpatient clinic is basi- 
cally ineffective. In this regard, it must be 
further noted that all members of the male 
outpatient sample had been clinically ap- 
praised to be “improved,” but no such 
appraisal of progress was available for the 
female outpatients. 

In summary, we find that in a situation in 
which we would anticipate positive response 
to therapy and improved psychiatric status 
and where there is clinical judgment of im- 


be revealed through gross study of changes 
in scale scores, item studies were made of 
the first and second responses of each S and 
records were obtained of the frequency of 
changed responses per item, per individual, 
and per group. Also, the frequency of 
change of response in certain groupings of 
items was determined. The relationship be- 
tween number of item changes (ic) and 
amount of score change (sc) was deter- 
mined for each S and the central tendencies 
and variabilities of this ratio (2ic/Xsc) were 
computed for each sample. These statistics 
permitted direct comparison with comparable 
measures obtained from the female Ss of the 
earlier report. 

Table 3 reports the means and standard 
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deviations for the per cent of item responses 
changed, from test to retest, in each of the 
present samples. Figures for the comparable 
female groups are also reported. It may be 
noted that there is close similarity between 
the males and females with respect to the 
magnitude of response variability in com- 
parable groups and, further, that each sex 
shows the same order of frequency of re- 
sponse changes among the various subject- 


TABLE 3 


MEANS AND STANDARD DeviaTions ror Per Cent or 
Responses CHANGED t 





Mean °%, CHANGED 
IN COMPARABLE 
FemMace Group 


Group 





UWC 
UWP 
PN 


14.0 
22.0 
18.4 
30.6 











t VALuEs 





PN 





UWC 
UWP 
PN 


1.54 
1.51 


-374 





t For comparable table, see Table 6 in original 
monograph (3). 
© 9=2.05. 


treatment groupings. In both sexes, the 
normals show lowest frequency of changed 
responses and the psychotics treated with 
ECT show the highest rate of change. In 
both sexes, the outpatients and the hospital- 
ized neurotics show a percentage change 
which is closely similar, which is approxi- 
mately equally greater than the rate of change 
in the normals of the respective sexes, and 
smaller than the per cent of change in the 
psychotic-shocked Ss. 

There are no reliable differences among the 
four samples of this study with respect to 
mean number of changed responscs. Varia- 
bility with respect to number of responses 


ScHOFIELD 


changed is reliably smaller in the controls 
than in any of the three clinical groups.* 

Analysis of the differences between com- 
parable male and female groups yielded some 
reliable sex variation with respect to frequency 
of response change. For both the normal and 
outpatient groups, the male samples showed 
a higher mean number of response changes, 
the differences significant beyond the .o1 
level in both instances. In comparing the 
hospitalized neurotic and shocked-psychotic 
samples of the two sexes respectively, signifi- 
cant differences in variability were found. In 
both comparisons, the males showed reliably 
greater variability in number of responses 
changed. In these latter comparisons, as in 
those for the normal and outpatient samples 
of the two sexes, the males yielded markedly 
higher mean frequencies of response change, 
but the differences in variability of the func- 
tion did not permit tests for reliability of the 
mean differences. 


Item Change vs. Score Change 


Table 4 reports the means and standard 
deviations of each of the groups for the ratio 
of number of items changed to amount of 
score change. Only 307 of the 550 items of 
the MMPI are scored on the nine clinical 
scales. Since certain of these items are scored 
on more than one scale, the total possible 
amount of scale score generated by these 307 
items is 476. Thus, 56 per cent of items 
(307/550) were related to score units, and 
each item contributes an average of 1.55 
(476/307) points. Assuming that changes in 
responses frora one MMPI record to another 
of a given individual were randomly dis- 
tributed among scored and unscored items, 
the above data generate the chance expectancy 
that 86 per cent (476/550) of response 
changes would be reflected in score changes 
(increases and decreases).° Stating this 
another way, with a completely random dis- 

* Basic statistics for these analyses are available in 
tables from the American Documentation Institute. See 
footnote 3. 

5In the original study, there was failure to consider 
the multiple-scale appearance of some of the scored 
items. This led to an erroneous conclusion that the 
normal Ss showed a chance distribution of their response- 
changes among scored and unscored items. This error, 
corrected in the present study, in no way invalidates the 
interpretation of significant differences among the various 


samples of the previous study with respect to relative 
concentration of changes among scored items. 
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tribution of response changes among scored 
and unscored items, one would expect the 
ratio of number of items changed to amount 
of score change (550/476) to approach 1.15. 
From Table 4, which presents the values for 
this ratio obtained in the four samples of this 
study, we see that all four groups deviate 
markedly from this expected value. Even 
the normals show a greater excess of item 
changes to score change than would result if 
the distribution of changes among scored and 


TABLE 4 


MEANS AND STANDARD Deviations For RATIO OF 
Numser oF Items CHANGED TO 
Amount oF Score CHANGE t 








MEAN 








F Vaues 
PN 








1.71 


3.85° 








PN PS-S 





UWC 
UWP 
PN 


.572 
327 


-925 
-034 





+For comparable table, see Table 10 in original 


monograph (3). 
* p=.05. 


unscored items was random. The larger ratio 
shown by all groups indicates a general tend- 
ency for response changes to occur with 
greater frequency among the unscored than 
among the scored items. However, since the 
patient groups, which previously have been 
shown to be characterized by a greater num- 
ber of response changes than the controls, 
show a smaller ratio of item change to score 
change, it is demonstrated that these patients 
make a proportionately greater number of 
changes in responses to scored items than to 
unscored items. This same pattern of greater 
concentration of changes 01 items which 
contribute to the clinical scores was found 
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also for the female patients studied pre- 
viously. As Table 4 shows, while the ratios 
indicate the sensitivity of the scored items as 
indicators of changes having psychiatric im- 
port, the differences among the four groups 
in the mean values of this ratio are not sig- 
nificant although they parallel the intergroup 
relationships which obtained in the earlier 
study. 

When significance tests were applied to the 
differences between the mean item-chan :e/ 
score-change ratios of comparable male and 
female groups, no reliable differences br .ween 
the sexes were found. The standa’ 1 devi- 
ation of the ratio for female nor: ials was 
reliably greater than that for the male 
controls. 


Response Variability by Item Content 


The items of the MMPI were originally 
classified into 25 categories of content for 
descriptive purposes. Examples of these cate- 
gories as given in the original paper describ- 
ing the instrument are: I. General Health; 
II. General Neurologic; III. Cranial Nerves; 
IV. Motility and Coordination; ....XV. Re- 
ligious Attitudes; XVI. Political Attitudes; 
XVII. Social Attitudes, etc. (1). Using these 
categories, it was possible to determine the 
frequency of response changes to the items 
in each of the content categories for each of 
the samples. This makes possible compari- 
sons of the content patterning of response 
changes among the samples of this study 
and comparison of the patterning of changes 
for these samples with comparable female 
samples.® Frequency of response changes per 
content category was not analyzed for the 
PN group because of the small size of the 
sample. 

Table 5 reports the rank-order correlations 
among three of the four male samples for the 
frequency of response change by content cate- 
gories. These correlations are all positive but 
low and of a magnitude to indicate consider- 
able dissimilarity of the groups in the relative 
concentrations of response changes among the 
25 categories of item content. For all three 
samples, the categories of Morale, Occupa- 


® A table reporting the per cent of items changed for 
each content category for the UWC, UWP, and PS-S 
groups and the rank order of these categories in each 
group for per cent of items changed is available from 
the American Documentation Institute. See footnote 3. 
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TABLE 5 


Rank-Orpver CORRELATION COEFFICIENTS FoR Fre- 
quency or Item Response CHANGES PER CoNn- 
TENT CaTEGoRY FOR THREE SAMPLES OF MALEs 








UWP PS-S 





-109 
+373 


-508 





tional, and General Health items ranked 
among the first six categories for frequency 
of item response change. By contrast, the 
category of Cardiorespiratory items which 
ranked second for frequency of response 
change in the controls (UWC) ranked 11 
and 25 in the UWP and PS-S groups re- 
spectively. Category IV, Motility and Co- 
ordination, which ranked in 19th place for 
frequency of change among the UWC cases, 
ranked 4.5 and 3.5 in the UWP and PS-S 
groups respectively. These findings suggest 
that there are certain categories of items, in 
terms of content, which elicit stable responses 
regardless of the Ss or treatments considered 
in a test-retest study, whereas other cate- 


gories of items demonstrate much higher fre- 
quencies of response change and differential 


frequencies for various subject-treatment 
groupings. In this regard, it is pertinent to 
note that the range in frequency of response 
change per category of item was 9.0 per cent 
to 27.2 per cent in the UWC group, 11.8 per 
cent to 35.4 per cent in the UWP sample, and 
21.0 per cent to 34.9 per cent in the PS-S 
group. 

Table 6 indicates the amount of similarity 
in the distribution of response changes per 
content category for comparable samples of 
male and female Ss in terms of rank-order 


TABLE 6 


RANK-Orper CorRRELATION COEFFICIENTS FOR Fre- 
quency oF Item Response CHANGES PER 
ConTENT CaTEGoRY FOR COMPARABLE 
Ma.e-FemMate SAMPLES 








Maes 


UWP 





FEMALES 





Normals 
OPD 
PS-S 


-616 





correlation coefficients. It is obvious that the 
relative concentration of response changes 
over the 25 categories is far from identical in 
comparable samples of unlike sex, but it 
appears that there is greater similarity for 
this variable among males and females of the 
same diagnostic status than among different 
diagnostic groups of the same sex.. In other 
words, the tendency of certain types of items, 
in terms of content, to elicit relatively stable 
or relatively variable responses is less a func- 
tion of the sex of the respondents than of 
other variables such as psychiatric status and 
form of treatment. 


Stable vs. Variable ltems 


Since the variability of the items (percent- 
age of responses per item changed from first 
to second testing) shows a large range in 
each of the groups (e.g., 0 to 51.8 per cent in 
the UWC group), it was decided to deter- 
mine what differences might characterize the 
most stable and most variable items in each 
sample. A table was prepared to show the 
percentage distribution by scales of the scores 
yielded by the most stable items (not over 
5 per cent change, and mostly o per cent 
change) and an equal number of the most 
variable items (minimal change of 37 per 
cent) for the UWC, UWP, and PS-S 
groups.’ The data for this table were then 
compared with those of the comparable table 
in the earlier study of females. For the nor- 
mal Ss, both sexes proved alike in showing a 
marked concentration of the stable items on 
the F and Se scales. The male and female 
outpatient samples were also similar in regard 
to the concentration of stable items on the 
F and Se scales. In the female groups, no 
consistent scale representations of the most 
variable items had been found. This same 
lack of consistent patterning of scale appear- 
ance of variable items was true also for the 
male samples of this investigation. However, 
in the UWP sample, the P¢ scale had a clear 
majority of the variable items, and in the 
PS-S group, the Pt scale was a close second 
to the Pd scale for concentration of variable 
items. Because of the stability of responses 
elicited by many items scored on the F and 
Se scales, marked changes in scores on these 


™ The complete table is .vailable from the American 
Documentation Institute. See footnote 3. 
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scales should stimulate a check for confusion, 
illiteracy, non-cooperativeness, or other fac- 
tors in the § making for invalidity of the 
MMPI record. In the absence of such factors, 
changes on these scales probably reflect valid 
and relatively stable personality changes 
rather than chance fluctuations or minor 
mood and attitudinal variation. 

Table 7 reports the percentage of the most 
stable and most variable items within each 
sample which were “obvious,” “subtle,” or 


not scored on any scale. The “obvious” and 
“subtle” items are composed of all items 
scored on the D, Hy, Pd, Pa, and Ma scales 
of the MMPI and have been keyed by Wiener 
and Harmon (3) according to the relative 
ease-or difficulty with which the nature of the 
socially acceptable response may be detected. 


on MMPI Responses 5 


critical ratios sufficiently large to result in 
their selection for scoring on any of the exist- 
ing MMPI scales” (3, p. 22; cf. Discussion 
of determinants of selection of items for 
scoring). 

Still further analysis of the stable and 
variable items was made to determine other 
possibly differentiating variables. The pro- 
portion of each class of item (stable and 
variable), in each group, involving frequency 
estimates (e.g., I feel weak all over much of 
the time) was determined. Both the stable 
and variable items showed clear predomi- 
nance of items without frequency estimate in 
all samples studied in this analysis (UWC, 
UWP, and PS-S), but the relative predomi- 
nance of “nonfrequency” to “frequency” 
items was much greater in the stable items 


TABLE 7 


Proportion oF Most StasLte AND Most Vartaste Items Wuicn Are “Osvious,” “Sustie,” anp Nor 
Scorep on Any Scate * 











Most Stasie ITems 


Most Varias_e Items 








No. oF PerCent PerCent Per Cent 
IrTEMs Osvious Sustte Nor Scorep Irems 


Per Cent 
Nor Scorep 


No. oF Pex Cent Per Cent 
Osvious SuBTLE 





UWC 37 40.5 ° 18.9 
UWP 20 50.0 ° 15.0 
PS-S 19 36.8 5.3 15.8 


39 
19 
26 





* For comparable table, see Table 12 in original monograph (3). 


From Table 7 it is seen that the stable items 
are predominantly of the obvious type. Less 
than a fifth of the stable items are unscored. 
By contrast, the variable items show no clear 
predominance of either obvious or subtle con- 
tent. Only the controls (UWC) show more 
subtle than obvious items among the variable 
items. In the other groups, the variable items 
are more frequently obvious than subtle, but 
the predominance of obvious items is not 
great. These findings exactly parallel those 
obtained with female Ss in the previous study. 
Another parallel finding demonstrated in 
Table 7 is the considerably greater frequency 
with which variable items prove to be un- 
scored, as contrasted with stable items. “In 
short, the variable items ... are character- 
ized by a tendency to be either of a type in 
which the direction of the socially desirable 
response is not readily perceptible or of a 
type which has not yielded normal-criterion 


for two of the three groups.* The average 
ratio of the number of “nonfrequeacy” to 
the number of “frequency” items among the 
stable items was 36.65 in this study; the com- 
parable average ratio for the variable items 
was only 1.64. Comparable ratios obtained 
from the female samples of the earlier study 
were 5.13 and 1.86 respectively. It is clearly 
more characteristic of stable items than of 
variable items not to involve a frequency 
estimate by the respondent, regardless of sex. 

When a study was made of the relationship 
between stability or variability of responses 
to items and the tense in which they were 
stated, present or past, it was found that 
variable items were characterized by a 
greater preponderance of “present tense” than 
were stable items. This finding is generally 

8A table reporting the basic data of this analysis is 


available from the American Documentation Institute. 
See footnote 3. 
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similar to that which obtained with the 
female Ss. It is logical that items involving 
matters of history should elicit more stable 
responses than those dealing with present 
moods and situations. However, the ratios 
of number of present-tense to number of 
past-tense items in the UWC group were 5.17 
and 6.80 for the stable and variable items 
respectively, suggesting that the relationship 
between tense of item and relative stability of 
response is by no means a simple one.” 

Of the total of 160 stable and variable items 
yielded by the UWC, UWP, and PS-S groups, 
only nine (5.6 per cent) were of a double- 
negative variety, i.¢., items which are stated 
in the negative and scored if answered 
“False” (e.g., I do not tire quickly). Eight 
of these nine items were from the variable 
group. A similar preponderance of double 
negatives among the variable items was 
found in the female study. 


SUMMARY AND CONCLUSIONS 


This study represents an attempt to analyze 
the effects of an interval of time and of cer- 
tain therapeutic programs on the responses to 
the MMPI of selected groups of Ss. It is con- 
cerned not only with the manner in which 
MMPI scale scores reflect the operation of 
therapeutic endeavors but also with a detailed 
study of the relationship between item re- 
sponses and therapy variables. Most specifi- 
cally, this study has sought to determine if 
the findings of a previous investigation with 
female Ss would be duplicated when com- 
parable analyses were applied to data from 
males. The major conclusions of this study 
are as follows. 

t. In general, the test and retest MMPI 
records of male normals and of male patients 
undergoing specified therapies show the same 
changes in scale scores and in item responses 
that characterize the records of comparable 
female Ss similarly analyzed. 

2. The MMPI profiles of successfully 
treated psychiatric patients reflect the success 
of therapy in significantly reduced scale 
scores. Where the success of therapy is less 
definite and where there is reason to sur- 
mise lack of improvement because of the 

® Ratios of number of present-tense to past-tense items 
among stable and variable items for the other groups are 


available in a table from the American Documentation 
Institute. See footnote 3. 


nature and extent of therapy, the age and 
intelligence of the patients, and other fas- 
tors, the MMPI scale scores remain essentially 
unchanged. 

3. Psychiatric patients retested following an 
interval of therapy change a larger number of 
item responses than do normals. Neurotics, 
whether treated as outpatients or hospitalized, 
show a similar frequency of response change 
which is median to the frequency of change 
found in normals at one extreme and that 
shown by psychotics treated with ECT at the 
other extreme. Intersubject variability in 
number of responses changed is significantly 
greater in all patient groups than in normals. 

4. There appears to be a tendency for 
males, norma!s or patients, to change a larger 
number of item responses on retest than 
do females. The sex difference was reliable 
in the case of the normal and outpatient 
comparisons. 

5. Normals and patients alike show a tend- 
ency to a greater frequency of change in re- 
sponse to unscored items than to items which 
are scored on any of the MMPI scales. For 
patients, however, the ratio of number of item 
responses changed to amount of change in 
scale score is smaller than that for normals. 
This finding, coupled with that of greater fre- 
quency of response change in patients, argues 
for a special sensitivity of the scored MMPI 
items as indicators of psychiatrically pertinent 
changes in the respondent. The two sexes 
are similar in this function. 

6. The tendency of items in terms of 
their content (e.g., general health, neurologic, 
morale, etc.) to elicit responses which prove 
relatively stable or relatively variable is less a 
function of the sex of the respondents and 
more a function of psychiatric status and type 
of therapy applied. 

7. An analysis was made to determine the 
characteristics of the most stable and the most 
variable items in terms of the responses of 
the different subject groups. It was found 
that the stable items (those eliciting the same 
response from test to retest) tended to be 
concentrated on the F and Se scales of the 
MMPI, to be obvious in nature, to involve no 
estimate by the respondent of the frequency 
of occurrence of a given behavior, and to be 
more frequently stated in the past rather than 
in the present tense. Variable items, by 
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contrast, show no clear concentrations on 
any of the MMPI scales, tend to be subtle 
rather than obvious, to involve estimates of 
frequency of given behavior, and to be stated 
in the present rather than the past tense. 
These findings are essentially identical to 
those obtained with female Ss. 

8. In essence, the findings from these 
analyses strengthen the conclusion of the 
earlier study as to the value of the MMPI as 
an objective technique for the evaluation of 
psychotherapeutic endeavors. 

Knowledge of these relationships between 
item structure and response stability should 
prove useful in the future construction of 
inventories having specialized purposes. 
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EFFECTS OF FEAR-AROUSING COMMUNICATIONS 


IRVING L. JANIS AND SEYMOUR FESHBACH 
Yale University 


and attitudes are modified, learning proc- 

esses are involved in which motivational 
factors play a primary role. Symbols in mass 
communications can be manipulated in a 
variety of ways so as to arouse socially 
acquired motives such as need for achieve- 
ment, group conformity, power-seeking, and 
the more emotion-laden drives arising from 
aggression, sympathy, guilt, and anxiety. 

The present experiment was designed to 
study the effects of one particular type of 
motive-incentive variable in persuasive com- 
munications, namely, the arousal of fear or 
anxiety by depicting potential dangers to 
which the audience might be exposed.’ Fear 
appeals of this sort are frequently used to 
influence attitudes and behavior. For ex- 
ample, medical authorities sometimes try to 
persuade people to visit cancer detection 
clinics by pointing to the dangerous conse- 
quences of failing to detect the early symp- 
toms of cancer; various political groups play 
up the threat of war or totalitarianism in an 
attempt to motivate adherence to their politi- 
cal program. Our interest in such attempts 
is primarily that of determining the condi- 
tions under which the arousal of fear is effec- 
tive or ineffective in eliciting changes in 
beliefs, practices, and attitudes. 

Implicit in the use of fear appeals is the 
assumption that when emotional tension is 
aroused, the audience will become more 
highly motivated to accept the reassuring 
beliefs or recommendations advocated by the 


I is generally recognized that when beliefs 


1 This study was conducted at Yale University as 
part of a coordinated program of research on attitude 
and opinion change, financed by a grant from the 
Rockefeller Foundation. The attitude change research 
project is under the general direction of Professor Carl 
I. Hovland, to whom the authors wish to express their 
appreciation for many valuable suggestions concerning 
the design of the experiment. Special thanks are due 
to Dr. Isador Hirschfeld of New York City and Dr. 
Bert G. Anderson of the Yale Medical School for their 
helpful advice in connection with the preparation of the 
illustrated talks on dental hygiene. The authors also 
wish to thank Dr. S. Willard Price, Superintendent of 
Schools at Greenwich, Connecticut, and Mr. Andrew 
Bella, Principal of the Greenwich High School, for their 
generous cooperation. 


communicator. But the tendency to accept 
reassuring ideas about ways and means of 
warding off anticipated danger may not 
always be the dominant reaction to a fear- 
arousing communication. Under certain con- 
ditions, other types of defensive reactions may 
occur which could give rise to highly unde- 
sirable effects from the standpoint of the 
communicator. 

Clinical studies based on patients’ reactions 
to psychiatric treatment call attention to three 
main types of emotional interference which 
can prevent a person from being influenced 
by verbal communications which deal with 
anxiety-arousing topics. 

1. When a communication touches off 
intense feelings of anxiety, communicatees 
will sometimes fail to pay attention to what 
is being said. Inattentiveness may be a 
motivated effort to avoid thoughts which 
evoke incipient feelings of anxiety. This 
defensive tendency may be manifested by 
overt attempts to change the subject of con- 
versation to a less disturbing topic. When 
such attempts fail and anxiety mounts to a 
very high level, attention disturbances may 
become much more severe, ¢.g., “inability 
to concentrate,” “distractibility,” or other 
symptoms of the cognitive disorganization 
temporarily produced by high emotional 
tension (4). 

2. When exposed to an anxiety-arousing 
communication, communicatees will occa- 
sionally react to the unpleasant (“punishing”) 
experience by becoming aggressive toward 
the communicator. If the communicator is 
perceived as being responsible for producing 
painful feelings, aggression is likely to take 
the form of rejecting his statements. 

3. If a communication succeeds in arousing 
intense anxiety and if the communicatee’s 
emotional tension is not readily reduced 
either by the reassurances contained in the 
communication or by self-delivered reassur- 
ances, the residual emotional tension may 
motivate defensive avoidances, i.e., attempts 
to ward off subsequent exposures to the 





Errects oF i Sas-ArousInc CoMMUNICATIONS 79 


anxiety-arousing content. The experience of 
being temporarily unable to terminate the 
disturbing affective state elicited by a discus- 
sion of a potential threat can give rise to a 
powerful incentive to avoid thinking or hear- 
ing about it again; this may ultimately result 
in failing to recall what the communicator 
said, losing interest in the topic, denying or 
minimizing the importance of the threat. 

The above reaction tendencies, while 
formulated in general terms, take account 
of three specific types of behavior observed 
during psychoanalytic or psychotherapeutic 
sessions (1, 2, 3). The first two refer to 
immediate reactions that often occur when 
a therapist gives an interpretation which 
brings anxiety-laden thoughts or motives into 
the patient’s focus of awareness: (a) ‘atten- 
tion disturbances, blocking of associations, 
mishearing, evasiveness, and similar forms 
of “resistance”; and (4) argumentativeness, 
defiance, contempt, and other manifestations 
of reactive hostility directed toward the 
therapist. The third refers to certain types 
of subsequent “resistance,” displayed during 
the later course of treatment, as a carry-over 
effect of the therapist’s disturbing comments 
or interpretations. 

Although the three types of defensive be- 
havior have been observed primarily in clini- 
cal studies of psychoneurotic patients (whose 
anxiety reactions are generally linked with 
unconscious conflicts), it seems probable that 
similar reactions may occur among normal 
persons during or after exposure to communi- 
cations which make them acutely aware of 
severe threats of external danger. Neverthe- 
less, it remains an open question whether 
such sources of emotional interference play 
any significant role in determining the net 
effectiveness of fear-arousing material in mass 
communications, especially when the com- 
munications are presented in an impersonal 
social setting where emotional responses of 
the audience are likely to be greatly 
attenuated. 

The present experiment was designed to 
investigate the consequences of using fear 
appeals in persuasive communications that 
are presented in an impersonal group situ- 
ation. One of the main purposes was to 
explore the potentially adverse effects which 
might result from defensive reactions of the 


sort previously noted in the more restricted 
situation of psychotherapy. 


METHOD 


The experiment was designed so as to provide 
measures of the effects of three different intensities 
of “fear appeal” in a standard communication on 
dental hygiene, presented to high school students. 
The influence of the fear-arousing material was 
investigated by means of a series of questionnaires 
which provided data on emotional reactions to the 
communication and on changes in dental hygiene 
beliefs, practices, and attitudes. 


The Three Forms of Communication 


A 15-minute illustrated lecture was prepared in 
three different forms, all of which contained the 
same essential information about causes of tooth 
decay and the same series of recommendations 
concerning oral hygiene practices. The three (re 
corded) lectures were of approximately equal length 
and were delivered in a standard manner by the 
same speaker. Each recording was supplemented 
by about 20 slides, which were shown on the screen 
in a prearranged sequence, to illustrate various 
points made by the speaker. 

The three forms of the illustrated talk differed 
only with respect to the amount of fear-arousing 
material presented. Form 1 contained a strong fear 
appeal, emphasizing the painful consequences of 
tooth decay, diseased gums, and other dangers that 
can result from improper dental hygiene. Form 2 
presented a moderate appeal in which the dangers 
were described in a milder and more factual man- 
ner. Form 3 presented a minimal appeal which 
rarely alluded to the consequences of tooth neglect. 
In Form 3, most of the fear-arousing material was 
replaced by relatively neutral information dealing 
with the growth and functions of the teeth. In all 
other respects, however, Forin 3 was identical with 
Forms 1 and 2. 

The fear appeals were designed to represent typi- 
cal characteristics of mass communications which 
attempt to stimulate emotional reactions in order to 
motivate the audience to conform to a set of 
recommendations. The main technique was that 
of calling attention to the potential dangers that 
can ensue from nonconformity. For example, the 
Strong appeal contained such statements as the 
following: 

If you ever develop an infection of this kind from 
improper care of your teeth, it will be an extremely 
serious matter because these infections are really 
dangerous. They can spread to your eyes, or your 
heart, or your joints and cause secondary infections 
which may lead to diseases such as arthritic paralysis, 
kidney damage, or total blindness. 


One of the main characteristics of the Strong 
appeal was the use of personalized threat-references 
explicitly directed to the audience, i.e., statements to 
the effect that “this can happen to you.” The 
Moderate appeal, on the other hand, described the 
dangerous consequences of improper oral hygiene 
in a more factual way, using impersonal language. 
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In the Minimal appeal, the limited discussion of 
unfavorable consequences also used a purely factual 
style. 

The major differences in content are summarized 
in Table 1, which is based on a systematic content 
analysis of the three recorded lectures. The data 
in this table show how often each type of “threat” 
was mentioned. It is apparent that the main differ- 
ence between the Strong appeal and the Moderate 
appeal was not so much in the total frequency of 
threat references as in the variety and types of 
threats that were emphasized. The Minimal 
appeal, however, differed markedly from the other 
two in that it contained relatively few threat refer- 
ences, almost all of which were restricted to 
“cavities” or “tooth decay.” 


TABLE | 


Content ANALYsIS OF THE THREE FoRMS OF THE 
COMMUNICATION: REFERENCES TO CoNSsE- 
QUENCES oF IMPROPER CARE OF 
THE TEETH 








Tyre Form1 Form 2 Form 3 
OF (Stronoc (Moperate (MINIMAL 
REFERENCE AppeaL) AppeEAL) APPEAL) 





Pain from toothaches 11 1 ° 
Cancer, paralysis, blindness 

or other secondary dis- 

cases 
Having teeth pulled, cavities 

drilled, or other painful 

dental work 
Having cavities filled or 

having to go to the 

dentist ° 5 
Mouth infections: sore, 

swollen, inflamed gums 18 16 
Ugly or discolored teeth 4 2 
“Decayed” teeth 14 12 
“Cavities” 9 12 
Total references to unfavor- 

able consequences 71 49 





One of the reasons for selecting dental hygiene 
as a suitable topic for investigating the influence of 
fear appeals was precisely because discussions of this 
topic readily lend themselves to quantitative and 
qualitative variations of the sort shown in Table tr. 
Moreover, because of the nature of the potential 
dangers that are referred to, one could reasonably 
expect the audience to be fairly responsive to such 
variations in content—the teeth and gums probably 
represent an important component in the average 
person’s body image, and, according to psycho- 
analytic observations, the threat of damage to the 
tecth and gums can sometimes evoke deep-seated 
anxieties concerning body integrity. In any case, 
by playing up the threat of pain, disease, and body 
damage, the material introduced in Form 1 is prob- 
ably representative of the more extreme forms of 
fear appeals currently to be found in persuasive 
communications presented via the press, radio, tele- 
vision, and other mass media. 


The fear appeals did not rely exclusively upon 
verbal material to convey the threatening conse- 
quences of nonconformity. In Form 1, the slides 
used to illustrate the lecture included a series of 
eleven highly realistic photographs which vividly 
portrayed tooth decay and mouth infections. 
Form 2, the Moderate appeal, included nine photo- 
graphs which were milder examples of oral pathol- 
ogy than those used in Form 1. In Form 3, 
however, no realistic photographs of this kind were 
presented: X-ray pictures, diagrams of cavities, and 
photographs of completely healthy teeth were sub- 
stituted for the photographs of oral pathology. 


Subjects 


The entire freshman class of a large Connecticut 
high school was divided into four groups on a 
random basis. Each of the three forms of the com- 
munication was given to a separate experimental 
group; the fourth group was used as a control group 
and was exposed to a similar communication on a 
completely different topic (the structure and func- 
tioning of the human eye). Altogether there were 
200 students in the experiment, with 50 in each 
group. 

The four groups were well equated with respect 
to age, sex, educational level, and IQ. The mean 
age for each group was approximately 15 years and 
there were roughly equal numbers of boys and girls 
in each group. The mean and standard deviation 
of IQ scores, as measured by the Otis group test, 
were almost identical in all four groups. 


Administration of the Questionnaires 


The first questionnaire, given one week before 
the communication, was represented to the students 
as a general health survey of high school students. 
The key questions dealing with dental hygiene were 
interspersed among questions dealing with many 
other aspects of health and hygiene. 

One week later the illustrated talks were given as 
part of the school’s hygiene program. Immediately 
after the end of the communication, the students 
in each group were asked to fill out a short ques- 
tionnaire designed to provide data on immediate 
effects of the communication, such as the amount 
of information acquired, attitudes toward the com- 
munication, and emotional reactions. A follow-up 
questionnaire was given one week later in order to 
ascertain the carry-over effects of the different forms 
of the communication. 


REsuULTs 


Affective Reactions 


Evidence that the three forms of the illus- 
trated talk differed with respect to “te 
amount of emotional tension evoked during 
the communication is presented in Table 2. 
Immediately after exposure to the communi- 
cation, the students were asked three ques- 
tions concerning the feelings they had just 
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TABLE 2 


Feetincs of Worry or Concern Evokeo Durinc THE COMMUNICATION 








QUESTIONNAIRE RESPONSES 


Moperate MINIMAL 
Group Group 
(N=50) (N=50) 


STRONG 
Group 
(N=50) 





Felt worried—a “few times” or “many times”—about own mouth condition 
Felt “somewhat” or “very’’ worried about improper care of own tecth 
Thought about condition of own teeth “most of the time” 


60% 
36% 
34% 


48% 
34% 
22% 


74% 
66% 








experienced “while the illustrated talk was 
being given.” ‘Their responses indicate that 
the fear stimuli were successful in arousing 
affective reactions. On each of the three 
questionnaire items shown in the table, the 
difference between the Strong group and the 
Minimal group is reliable at beyond the .05 
confidence level.2 The Moderate group con- 
sistently falls in an intermediate position but 
does not, in most instances, differ significantly 
from the other two groups. 

Further evidence of the effectiveness of the 
fear-arousing material was obtained from 
responses to the following two questions, 
each of which had a checklist of five answer 
categories ranging from “Very worried” to 
“Not at all worried”: 

1. When you think about the possibility 


2 All probability values reported in this paper are 
based on one tail of the theoretical distribution, since 
the results were used to test specific hypotheses which 
predict the direction of the differences. 


that you might develop diseased gums, how 
concerned or worried do you feel about it? 
2. When you think about the possibility 
that you might developed decayed teeth, how 
concerned or worried do you feel about it? 
Since these questions made no reference to 
the illustrated talk, it was feasible to include 
them in the pre- and postcommunication 
questionnaires given to all four groups. 
Systematic comparisons were made in 
terms of the percentage in each group who 
reported relatively high disturbance (i.e., 
“somewhat” or “very worried”) in response 
to both questions. The results, presented in 
Table .3, show a marked increase in affective 
disturbance among each of the three experi- 
mental groups, as compared with the control 


group. Paralleling the results in Table 2, the 
greatest increase is found in the Strong 
group. The difference between the Moderate 


and the Minimal groups, however, is 


insignificant. 


TABLE 3 


PerRcENTAGE oF Each Group Wuo Reportep Feetinc SomewHaT orn Very Worriep asout Decayep Test 
AND Diseasep Gums 








STRONG 
Group 
(N=50) 


ConTROL 
Group 
(N=50) 


MINIMAL 
Group 
(N=50) 


MopeERATE 
Group 
(N=50) 





One week before the communication 
Immediately after the communication 
Change 


34 24 22 30 
76 
+42% 


50 46 38 
+26% +24% +8% 





Group 


Reviasicity or Dirrerence * 
CR p 





Strong vs. Control 
Strong vs. Minimal 
Strong vs. Moderate 
Moderate vs. Control 
Moderate vs. Minimal 
Minimal vs. Control 


.06 <.o1 
+59 -06 
+37 -09 
+54 -06 
-17 +43 
+43 -08 





* The statistical test used was the critical ratio for reliability of differences in amount of change between 
two independent samples, as described by Hovland, Lumsdaine, and Sheffield (5, p. 321). 
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In order to obtain an over-all estimate of 
the relative degree of emotional arousal 
evoked by the three forms of the communi- 
cation, a total score was computed for each 
individual in each experimental group, based 
on answers to all five questions: two points 
credit was given to each response specified 
in Tables 2 and 3 as indicative of high dis- 
turbance; one point credit was given to 
intermediate responses on the checklist; zero 
credit was given for the last two response 
categories in each check list, which uniformly 
designated a relative absence of worry or 
concern. Hence individual scores ranged 
from zero to ten. The mean scores for the 
Strong, Moderate and Minimal groups were 
78, 6.6, and 5.9 respectively. The Strong 
group differs reliably at the one per cent 
confidence level from each of the other two 
groups (#==2.3 and 3.6). The difference be- 
tween the Moderate and Minimal groups 
approaches reliability at the .o8 confidence 
level (t—=1.4). 

In general, the foregoing evidence indicates 
that after exposure to the communications, 
the Strong group felt more worried about the 
condition of their teeth than did the other 
two groups; the Moderate group, in turn, 
tended to feel more worried than the Mini- 
mal group. 


Information Acquired 


Immediately after exposure to the illus- 
trated talk, each experimental group was 
given an information test consisting of 23 
separate items. The test was based on the 
factual assertions common to all three forms 
of the communication, including topics such 
as the anatomical structure of the teeth, the 
causes of cavities and of gum disease, the 
“correct” technique of toothbrushing, and 
the type of toothbrush recommended by 
dental authorities. No significant differences 
were found among the three experimental 
groups with respect to information test 
scores. Comparisons with the Control group 
show that the three forms of the dental 
hygiene communication were equally effec- 
tive in teaching the factual material. 


Attitude Toward the Communication 


The questionnaire given immediately after 
exposure to the illustrated talk included a 
series of seven items concerning the students’ 


appraisals of the communication. From the 
results shown in Table 4, it is apparent that 
the Strong group responded more favorably 
than the other two groups. 

These findings imply that interest in the 
communication and acceptance of its educa- 
tional value were heightened by the Strong 
appeal. But this conclusion applies only to 
relatively impersonal, objective ratings of the 
communication. Additional evidence pre- 
sented in Table 5, based on questions which 
elicited evaluations of a more subjective char- 
acter, reveals a markedly different attitude 
toward the communication among those ex- 
posed to the Strong appeal. 

One of the additional questions was the 
following: “Was there anyihung in the illus- 
trated talk on dental hygiene that you dis- 
liked?” Unfavorable (“dislike”) answers 
were given by a reliably higher percentage of 
students in the Strong group than in the 
Moderate or Minimal groups (first row of 
Table 5). A tabulation was also made of 
the total number of students in each group 
who gave complaints in their answers to 
either of two open-end questions which asked 
for criticisms of the illustrated talk. The 
results on complaints about the unpleasant 
character of the slides are shown in row two 
of Table 5; the difference between the Strong 
group and each of the other two groups is 
reliable at the .o1 confidence level. Similarly, 
a reliably higher percentage of the Strong 
group complained about insufficient material 
on ways and means of preventing tooth and 
gum disease (row three of Table 5). The 
latter type of criticism often was accompanied 
by the suggestion that some of the disturbing 
material should be eliminated, as is illus- 
trated by the following comments from two 


%The Strong group differs significantly (p<.o5) 
from the Minimal group on five of the seven items 
and from, the Moderate group on three items; the 
Moderate ‘group does not differ reliably from the 
Minimal group on any of the items. 

In row three of Table 5, the difference between the 
Strong and Moderate groups is reliable at the .o1 
confidence level, and the difference between the Strong 
and Minimal groups is significant at the .08 level. 
Other types of criticisms, in addition to those shown 
in Table 5, were also tabulated. Most of these involved 
minor aspects of the presentation (¢.g., “a movie would 
have been better than slides”) and were given by 
approximately equal percentages of the three groups. 
The vast majority of students in the Moderate and 
Minimal groups expressed approval of the illustrated 
talk or stated that they had no criticisms. 
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TABLE 4 


PERCENTAGE oF Eaco Group Wuo Expressep Stroncty Favorasite APPRAISALS OF THE COMMUNICATION 





APPRAISAL RESPONSE 


Srronc Moperate MINIMAL 
Group Group Group 
(N=50) (N=50) (N=s50) 





The illustrated talk does a very good teaching job. 
Most or all of it was interesting. 


62 50 40 
80 68 64 


It was very easy to pay attention to what the speaker was saying. 74 36 50 


My mind practically never wandered. 
The slides do a very good job. 
The speaker’s voice was very good. 


The illustrated talk definitely should be given to all Connecticut high schools. "74 


58 46 42 
52 20 22 
66 56 58 
58 70 





students in the Strong group: “Leave out the 
slides that show the rottiness of the teeth and 
have more in about how to brush your teeth”; 
“I don’t think you should have shown so 
many gory pictures without showing more 
to prevent it.” Comments of this sort, to- 
gether with the data presented in Table 5, 
provide additional evidence of residual emo- 
tional tension. They imply that the Strong 
appeal created a need for reassurance which 
persisted after the communication was over, 
despite the fact that the communication 
contained a large number of reassuring 
recommendations. 

The apparent inconsistency between the 
results in Tables 4 and 5 suggests that the 
Strong appeal evoked a more mixed or 
ambivalent attitude toward the communi- 
cation than did the Moderate or Minimal 
appeals. Some of the comments, particularly 
about the slides, help to illuminate the differ- 
entiation between the individual’s objective 
evaluation of the communication and his 
subjective response to it. The following 
illustrative excerpts from the Strong group 
were selected from the answers given to the 
open-end question which asked for criticisms 
and suggestions: 


I did not care for the “gory” illustrations of 
decayed teeth and diseased mouths but I really think 
that it did make me feel sure that I did not want 
this to happen to me. 

Some of the pictures went to the extremes but 
they probably had an effect on most of the people 
who wouldn’t want their teeth to look like that. 


I think it is good because it scares people when 
they see the awful things that can happen. 


Such comments not only attest to the moti- 
vational impact of the Strong appeal, but also 
suggest one of the ways in which the dis- 
crepancy between subjective and objective 
evaluations may have been reconciled. In 
such cases, the ambivalence seems to have 
been resolved by adopting an attitude to the 
effect that “this is disagreeable medicine, but 
it is good for us.” 


Conformity to Dental Hygiene Recommen- 
dations 


The immediate effects of the illustrated 
talks described above show the type of affec- 
tive reactions evoked by the fear-arousing ma- 
terial but provide little information bearing 
directly on attitude changes. The question- 
naire administered one week later, however, 
was designed to measure some of the major 
carry-over effects of fear appeals, particularly 


r 


TABLE 5 


Percentace or Each Group Wuo Expressep CoMPLAINTs ABOUT THE COMMUNICATION 








Type or ComPLaINT 


Srronc Moperare Minima 
Group Group Group 
(N=50) (N=50) (N=50) 





Disliked something in the illustrated talk. 


28 & 


2 
The slides were too unpleasant (“horrible,” “gory,” “disgusting,” etc.). 34 2 ° 
8 


There was not enough material on prevention. 


20 2 
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with respect to changes in dental hygiene 
practices, beliefs, and preferences. The re- 
sults provide an empirical basis for estimating 
the degree to which such communications 
succeed in modifying attitudes. 

Personal practices were investigated by 
asking the students to describe the way they 
were currently brushing their teeth: the type 
of stroke used, the amount of surface area 
cleansed, the amount of force applied, the 
length of time spent on brushing the teeth, 
and the time of day that the teeth were 
brushed. The same five questions were 
asked one week before the communication 


four groups had very low scores and the 
group differences were insignificant. By 
comparing the score that each individual 
attained one week after the communication 
with that attained two weeks earlier, it was 
possible to determine for each group the 
percentage who changed in the direction of 
increased or decreased conformity. 

The results, shown in Table 6, reveal that 
the greatest amount of conformity was pro- 
duced by the communication which con- 
tained the least amount of fear-arousing 
material. The Strong group showed reliably 
less change than the Minimal group; in fact, 


TABLE 6 


Errect oF THE ILLtustraTep TALK ON CoNnFoRMITY TO Dentat Hycienet RECOMMENDATIONS 








STRONG 
Group 
(N=50) 


Type or CHANGE 


ConTROL 
Group 
(N=50) 


MINIMAL 
Group 
(N=50) 


MopERATE 
Group 
(N=50) 





28% 
20% 
52% 


+8% 


Increased conformity 
Decreased conformity 
No change 

Net change in conformity 


22% 
22% 
56% 

o% 


44% 
22% 
34% 
+22% 


50% 
14% 
36% 
+36% 





Group 


ReviaBiLiry oF Dif FERENCE 





Control vs. Minimal 
Control vs. Moderate 
Control vs. Strong 
Strong vs. Moderate 
Strong vs. Minimal 
Moderate vs. Minimal 





and again one week after. These questions 
covered practices about which the following 
specific recommendations were made in all 
three forms of the illustrated talk: (a) the 
teeth should be brushed with an up-and-down 
(vertical) stroke; (4) the inner surface of the 
teeth should be brushed as well as the outer 
surface; (¢c) the teeth should be brushed 
gently, using only a slight amount of force; 
(d) in order to cleanse the teeth adequately, 
one should spend about three minutes on 
each brushing; (¢) in the morning, the teeth 
should be brushed after breakfast (rather 
than before). 

Each student was given a score, ranging 
from zero to five, which represented the 
number of recommended practices on which 
he conformed. Before exposure to the com- 
munication, the majority of students in all 


the Strong group failed to differ significantly 
from the Control group, whereas the Mini- 
mal group showed a highly reliable increase 
in conformity as compared with the Control 
group. The Moderate group falls in an 
intermediate position, but does not differ 
reliably from the Strong or Minimal groups. 
Although there is some ambiguity with re- 
spect to the relative effectiveness of the 
Moderate appeal, the data in Table 6 show a 
fairly consistent trend which suggests that as 
the amount of fear-arousing material is in- 
creased, conformity tends to decrease. In 
contrast to the marked increase in conformity 
produced by the Minimal appeal and the 
fairly sizable increase produced by the Mod- 
erate appeal, the Strong appeal failed to 
achieve any significant effect whatsoever. 
One cannot be certain, of course, that the 
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findings represent changes in overt behavioral 
conformity, since the observations are based 
on the Ss’ own verbal reports, What remains 
problematical, however, is whether the verbal 
responses reflect only “lip-service” io the 
recommendations or whether they also reflect 
internalized attitudes that were actually car- 
ried out in action. The results, neverthe- 
less, demonstrate that the Strong appeal was 
markedly less effective than the Minimal 
appeal, at least with respect to eliciting verbal 
conformity. 

Further evidence in support of the same 
conclusion comes from responses pertinent to 
a different type of dental hygiene behavior 
which had also been recommended in the 
illustrated talk.5 The students were asked to 
give the approximate date on which they had 
last gone to a dentist. The percentage in 
each group whose answers indicated that they 
had gone to the dentist during the week fol- 
lowing exposure to the illustrated talk were 
as follows: 10 per cent of the Strong group, 
14 per cent of the Moderate group, 18 per cent 
of the Minimal group, and 4 per cent of the 
Control group. The percentage difference 
between the Minimal group and the Control 


group was found to be statistically reliable 
at the .o4 confidence level; none of the other 


comparisons yielded reliable differences. 
Although not conclusive evidence, these find- 
ings are in line with those in Table 6: the 
Minimal appeal again appears to have been 
superior with respect to eliciting conformity 
to a recommended practice. 


Beliefs Concerning the “Proper” Type of 
Toothbrush 


The illustrated talk presented an extensive 
discussion of the “proper” type of toothbrush 
recommended by dental authorities. Four 
main characteristics were emphasized: (a) 
the bristles should be of medium hardress, 
(6) the brush should have three rows of 
bristles, (c) the handle should be completely 
straight, and (d) the brushing surface should 
be completely straight. Personal beliefs con- 
cerning the desirability of these four charac- 

5In all three forms of the illustrated talk, an explicit 
recommendation was made concerning the desirability 
of obtaining advice from a dentist about one’s own 
toothbrushing technique. In addition, several references 
were made to the importance of going to a dentist for 


prompt treatment of cavities, before the decay spreads 
to the inner layers of the tooth. 


teristics were measured by four questions 
which were included in the precommuai- 
cation questionnaire as well as in the 
questionnaire given one weck after the com- 
munication. The main finding was that all 
three experimental groups, as compared with 
the Control group, showed a significant 
change in the direction of accepting the con- 
clusions presented in the communication. 
Among the three experimental groups, there 
were no significant differences with respect 
to net changes. Nevertheless, as will be seen 
in the next section, the fear-arousing material 
appears to have had a considerable effect on 
the degree to which the students adhered to 
such beliefs in the face of counteracting 
propaganda. 


Resistance to Counteracting Propaganda 


In addition to describing the four essential 
characteristics of the “proper” toothbrush, 
the illustrated talk contained numerous com- 
ments and illustrations to explain the need 
for avoiding the “wrong” kind of toothbrush. 
Much of the material on cavities and other 
unpleasant consequences of tooth neglect was 
presented in this context. The importance 
of using the proper kind of toothbrush was 
the theme that was most heavily emphasized 
throughout the entire communication. 

The key questionnaire item, designed to 
determine initial attitudes before exposure to 
the communication, was the following: 


Please read the following statement carefully and 
decide whether you believe it is true or false. 


It does not matter what kind of toothbrush a 
person uses. Any sort of toothbrush that is sold in 
a drugstore will keep your teeth clean and healthy— 
if you use it regularly. 


Do you think that this statement is true or false? 
(Check one.) 


One week after exposure to the communi- 
cations, the question was asked again, in 
essentially the same form, with the same 
checklist of five answer categories (ranging 
from “Feel certain that it is true” to “Feel 
certain that it is false”). But in the post- 
communication questionnaire, the question 
was preceded by the following propaganda 
material which contradicted the dominant 
theme of the illustrated talk: 


A well-known dentist recently made the following 
statement: 
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Some dentists, including a number of so-called 
“experts” on dental hygiene, claim it is important 
to use a special type of toothbrush in order to clean 
the teeth properly. But fron: my own experience, 
I believe that there is no sound basis for that ides. 
My honest opinion, as a dentist, is that it does not 
matter what kind of toothbrush a person uses. Any 
sort of toothbrush that is sold in a drugstore will 
keep your teeth clean and healthy—if you use it 
regularly. 


That this propaganda exposure had a pro- 
nounced effect is revealed by the attitude 
changes shown by the Control group. A 
statistically reliable change in the direction 


definite answer emerges from the results in 
Table 7, which shows the percentage of each 
group who changed in the direction of agree- 
ment or disagreement with the counterpropa- 
ganda statement. 

Before exposure to the illustrated talk, the 
group differences were negligible: approxi- 
mately 50 per cent of the students in each of 
the four groups agreed with the statement 
that “it does not matter what kind of tooth- 
brush a person uses.” But two weeks later 
(immediately after exposure to the counter- 


TABLE 7 


Errect or THe ItuustraTrep TALK ON REACTIONS To SUBSEQUENT COUNTERPROPAGANDA: Net PeRcENTAGE 
or Eacn Group Wuo CHancep tn THE DimecTIon oF AGREEING WITH THE STATEMENT THAT 
“Ir Dogs Not Matrer wuat Kinp oF TootusrusH A Person Uses” 








Type or CHANGE 


STRONG MopERATE MINIMAL ConrTROL 
Group Group Group Group 
(N=50) (N=50) (N=50) (N=50) 





More agreement 
Less agreement 
No change 
Net change 
Net effect of exposure to the illustrated talk 


30 28 14 44 

38 42 54 24 

32 30 32 32 

—8 —14 —40 +20 
—28 —34 —60 





Group 


RELIABILITY OF THE DIFFERENCES IN Net CHANGE 
CR P 





Control vs. Minimal 
Control vs. Moderate 
Control vs. Strong 
Strong vs. Moderate 
Strong vs. Minimal 
Moderate vs. Minimal 


- 66 <.001 
.05 .02 
-71 -05 

36 
-03 +02 
-66 .05 





of more agreement with the counterpropa- 
ganda was found in the Control group.° 
How effective were the three forms of the 
illustrated talk in preventing students from 
accepting the propaganda to which they were 
exposed one week later? Did the fear appeals 
augment or diminish the students’ resistance 
to the counteracting propaganda? A fairly 


®In the Control group, the percentage who disagreed 
with the statement dropped from 54 to 34. This change 
proved to be significant at below the .02 confidence 
level, according to the formula described by Hovland, 
Lumsdaine, and Sheffield (5, p. 319). The Control 
group did not show any significant change on other 
questions dealing with dental hygiene beliefs, preferences 
or practices, all of which were presented in the final 
questionnaire before the propaganda material was intro- 
duced. Consequently, it seems fairly safe to conclude 
that the propaganda exposure was responsible for the 
significant change displayed by the Control group. 


propaganda) there were marked and statisti 
cally reliable differences which indicate that 
although all three forms of the illustrated 
talk had some influence, the Minimal appeal 
was most effective in producing resistance to 
the counterpropaganda. Thus, the results 
suggest that under conditions where people 
will be exposed to competing communica- 
tions dealing with the same issues, the use of 
a strong fear appeal will tend to be less effec- 
tive than a minimal appeal in producing 
stable and persistent attitude changes. 
Some clues to mediating processes were 
detected in the students’ responses to an open- 
end question which asked them to “give the 
reason” for their answers to the key attitude 
item on which the results in Table 7 are 
based. A systematic analysis was made of 
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the write-in answers given by those students 
who had disagreed with the counterpropa- 
ganda. In their refutations, some of the 
students made use of material that had been 
presented one week earlier, either by referring 
to the illustrated talk as an authoritative 
source or by citing one of the main arguments 
presented in the illustrated talk. From the 
results presented in the first two rows of 
Table 8, it is apparent that such refutations 
were given more frequently by the Minimal 
group than by the other experimental groups. 
The comparatively low frequency of such 
answers in the Strong and Moderate groups 
was not compensated for by an increase in 
any other type of specific reasons, as indicated 
by the results in the last row of the table." 


were inclined to avoid recalling the content 
of the fear-arousing communication. 


Discussion 


The results in the preceding sections indi- 
cate that the Minimal appeal was the most 
effective form of the communication in that 
it elicited (a) more resistance to subsequent 
counterpropaganda and (4) a higher inci- 
dence of verbal adherence, and perhaps a 
greater degree of behavioral conformity, to a 
set of recommended practices. The absence 
of any significant differences on other indi- 
cators of preferences and beliefs implies that 
the Moderate and Strong appeals had no 
unique positive effects that would compen- 
sate for the observed detrimental effects. 


TABLE 8 


Types or ReruTation Given By StupENTs WHo Di1saGREED WITH THE COUNTERPROPAGANDA 








Type or REFUTATION 


ConTROL 
Group 
(N=18) 


MINIMAL 
Group 
(N=39) 


MopvERATE 
Group 
(N=29) 


STRONG 
Group 
(N=30) 





Explicit reference to the illustrated talk as an authori- 
tative source for the opposite conclusion 

One or more arguments cited that had been presented 
in the illustrated talk 

One or more arguments cited that contradicted the 
content of the illustrated talk 

No answer or no specific reason given 


0% 
28% 


22% 
50% 


7% 14% 18% 
43% 38% 59% 


o% o% o% 
50% 52% 36% 





Although the group differences are not 
uniformly reliable, they reveal a consistent 
trend which suggests an “avoidance” tend- 
ency among the students who had been 
exposed to the fear appeals. Apparently, even 
those who resisted the counterpropaganda 


7On the first type of reason (reference to the illus- 
trated talk), the only difference large enough to approach 
statistical reliability was that between the Minimal 
group and the Control group (p=.o8). On the second 
type of reason (arguments cited from the illustrated 
talk), the difference between the Minimal group and 
the Control group was found to be highly reliable 
(p=.03) while the difference between the Minimal 
and Moderate groups approached statistical reliability 
(p=.08). The Control group differed reliably from 
each of the experimental groups (at beyond the .10 
confidence level) with respect to giving arguments 
which contradicted those contained in the illustrated 
talk (row three of the table). None of the other 
percentage differences in Table 8 were large enough 
to be significant at the .10 confidence level. (In some 
columns, the percentages add up to more than 100 per 
cent because a few students gave more than one type 
of refutation.) 


Thus, the findings consistently indicate that 
inclusion of the fear-arousing material not 
only failed to increase the effectiveness of the 
communication, but actually interfered with 
its over-all success. 

The outcome of the present experiment by 
no means precludes the possibility that, under 
certain conditions, fear appeals may prove to 
be highly successful. For instance, the Strong 
appeal was found to be maximally effective 
in arousing interest and in eliciting a high 
degree of emotional tension. The evocation 
of such reactions might augment the effective- 
ness of mass communications which are 
designed to instigate prompt audience action, 
such as donating money or volunteering to 
perform a group task. But if the communi- 
cation is intended to create more sustained 
preferences or attitudes, the achievement of 
positive effects probably depends upon a 
number of different factors. Our experi- 
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mental results suggest that in the latter case, 
a relatively low degree of fear arousal is 
likely to be the optimal level, that an appeal 
which is too strong will tend to evoke some 
form of interference which reduces the effec- 
tiveness of the communication. The findings 
definitely contradict the assumption that as 
the dosage of fear-arousing stimuli (in a mass 
communication) is increased, the audience 
will become more highly motivated to accept 
the reasssuring recommendations contained 
in the communication. Beneficial motivating 
effects probably occur when a relatively slight 
amount of fear-arousing material is inserted; 
but for communications of the sort used in 
the present experiment, the optimal dosage 
appears to be far below the level of the 
strongest fear appeals that a communicator 
could use if he chose to do so. 

Before examining the implications of the 
findings in more detail, it is necessary to take 
account of the problems of generalizing from 
the findings of the present study. The 
present experiment shows the effects of only 
one type of communication, presented in an 
educational setting to a student audience. 
Until replications are carried out—using other 
media, topics, and fear-eliciting stimuli, in a 
variety of communication settings, with dif- 
ferent audiences, etc.—one cannot be certain 
that the conclusions hold true for other situ- 
ations. The results from a single experiment 
are obviously not sufficient for drawing broad 
generalizations concerning the entire range 
of fear-arousing communications which are 
currently being brought to the focus of public 
attention. Nor can unreplicated results be 
relied upon for extracting dependable rubrics 
that could be applied by educators, editors, 
public relations experts, propagandists, or 
other communication specialists who face the 
practical problems of selecting appropriate 
appeals for motivating mass audiences. 

Nevertheless, the present experiment helps 
to elucidate the potentially unfavorable effects 
that may result from mass communications 
which play up ominous threats, alarming 
contingencies, or signs of impending danger. 
For instance, the findings tend to bear out 
some of the points raised concerning the need 
for careful pretesting and for other cautions 
when warnings about the dangers of atomic 
bombing are presented in civilian defense 


communications that are intended to prepare 
the public for coping with wartime emer- 
gencies (6). Moreover, despite our inability 
to specify the range of communications to 
which our conclusions would apply, we can 
derive tentative inferences that may have im- 
portant theoretical implications with respect 
to the dynamics of “normal” fear reactions. 

We turn now to a central question posed 
by the experimental findings: Why is it that 
the fear-arousing stimuli resulted in less 
adherence to recommended practices and less 
resistance to counterpropaganda? Although 
our experiment cannot give a definitive 
answer, it provides some suggestive leads 
concerning potential sources of emotional 
interference. 

In the introduction, we have described 
three forms of “resistance” frequently ob- 
served in psychotherapy that might also 
occur among normal personalities exposed to 
mass communications which evoke strong 
fear or anxiety: (@) inattentiveness during 
the communication session, (4) rejection of 
the communicator’s statements motivated by 
reactive aggression, and (c) subsequent de- 
fensive avoidance motivated by residual emo- 
tional tension. We shall discuss briefly the 
pertinent findings from the present experi- 
ment with a view to making a preliminary 
assessment of the importance of each of the 
three types of interfering reactions. 

1. Our results provide no evidence that a 
strong fear appeal produces inattentiveness or 
any form of distraction that would interfere 
with learning efficiency during the communi- 
cation session. The three forms of the com- 
munication were found to be equally effective 
in teaching the factual material on dental 
hygiene, as measured by a comprehensive 
information test given immediately after 
exposure to the communication. Beliefs con- 
cerning the desirable characteristics of the 
“proper” type of toothbrush were also 
acquired equa'ly weil. One might even sur- 
mise (from the results in Table 4) that the 
Strong appeal may have had a beneficial effect 
on attention, because a significantly higher 
percentage of the Strong group reported that 
(a) it was very easy to pay attention to what 
the speaker was saying and (4) they experi- 
enced very little “mind-wandering.” 

The absence of any observable reduction of 
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learning efficiency is consistent with numer- 
ous clinical observations which imply that 
normal personalities can ordinarily tolerate 
unpleasant information concerning potential 
threats to the self without manifesting any 
marked impairment of “ego” functions. Our 
findings definitely suggest that the use of 
fear-arousing material of the sort presented 
in the illustrated talks would rarely give rise 
to any interference with the audience’s ability 
to learn the content of the communication. 

It is necessary to bear in mind, however, 
that in the present experiment the communi- 
cation was given to a “captive” classroom 
audience. When people are at home listening 
to the radio, or in any situation where they 
feel free to choose whether or not to termi- 
nate the communication exposure, the use of 
strong emotional appeals might often have 
drastic effects on sustained attention. Con- 
sequently, the tentative generalization con- 
cerning the low probability of inattentiveness 
would be expected to apply primarily to those 
fear-arousing communications which are pre- 
sented under conditions where social norms 
or situational constraints prevent the audience 
from directing attention elsewhere. 

Even with a “captive” audience, it is quite 
possible that under certain extreme conditions 
a strong fear appeal might interfere with 
learning efficiency. For instance, the same 
sort of temporary cognitive impairment that 
is sometimes observed when verbal stimuli 
happen to touch off unconscious personal 
conflicts or emotional “complexes” might also 
occur when a mass communication elicits 
sharp awareness of unexpected danger, par- 
ticularly when the audience immediately 
perceives the threat to be imminent and 
inescapable. Hence, the inferences from our 
experimental findings probably should be 
restricted to fear appeals which deal with 
remote threats or with relatively familiar 
dangers that are perceived to be avoidable. 

2. The fact that the Strong group expressed 
the greatest amount of subjective dislike of 
the illustrated talk and made the most com- 
plaints about its content could be construed 
as suggesting a potentially aggressive attitude. 
But if the aggressive reactions aroused by the 
use of the Strong fear appeal were intense 
enough to motivate rejection of the conclu- 
sions, one would not expect to find this group 


giving the most favorable appraisals of the 
interest value of the illustrated talk, of the 
quality of its presentation, and of its over-all 
educational success. Thus, although the pos- 
sibility of suppressed aggression cannot be 
precluded, it seems unlikely that this factor 
was a major source of emotional interference. 
In drawing this tentative conclusion, how- 
ever, we do not intend to minimize the 
importance of aggression as a potential source 
of interference. In the present experiment, 
the communication was administered as an 
official part of the school’s hygiene program 
and contained recommendations that were 
obviously intended to be beneficial to the 
audience. Under markedly different condi- 
tions, where the auspices and intent of the 
communication are perceived to be less be- 
nign, the audience would probably be less 
disposed to suppress or control aggressive re- 
actions. The low level of verbalized aggres- 
sion observed in the present study, however, 
suggests that in the absence of cues which 
arouse the audience’s suspicions, some factor 
other than reactive hostility may be a much 
more important source of interference. 

3. Subsequent defensive avoidance arising 
from residual emotional tension seems to be 
the most likely explanation of the outcome of 
the present study. We have seen, from the data 
on immediate affective reactions, that the dis- 
turbing feelings which had been aroused 
during the illustrated talk tended to persist 
after the communication had ended, despite 
the reassuring recommendations which had 
been presented. The analysis of complaints 
made by the three experimental groups 
(Table 5) provides additional evidence that 
the need for reassurance persisted primarily 
among the students who had been exposed 
to the Strong appeal. Such findings support 
the following hypothesis: When a mass com- 
munication is designed to influence an audi- 
ence to adopt specific ways and means of 
averting a threat, the use of a strong fear 
appeal, as against a milder one, increases the 
likelihood that the audience will be left in @ 
state of emotional tension which is not fully 
relieved by rehearsing the reassuring recom- 
mendations contained in the communication. 
This hypothesis is compatible with the gen- 
eral assumption that when a person is exposed 
to signs of “threat,” the greater the intensity 
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of the fear reaction evoked, the greater the 
likelihood that his emotional tension will 
persis: fter the external stimulus has 
terminated. 

Whether or aot the above hypothesis is 
correct, the fact remains that “unreduced” 
emotional tension was manifested immedi- 
ately after the communication predominantly 
by the group exposed to the Strong appeal. 
Our findings on subsequent reactions provide 
some suggestive evidence concerning the 
consequences of experiencing this type of 
residual tension. In general, the evidence 
appears to be consistent with the following 
hypothesis: When fear is strongly aroused 
but is not fully relieved by the reassurances 
contained in a mass communication, the 
audience will become motivated to ignore or 
to minimize the importance of the threat. 
This hypothesis could be regarded as a 
special case of the following general propo- 
sition which pertains to the effects of human 
exposure to any fear-producing stimulus: 
other things being equal, the more persistent 
the fear reaction, the greater will be the 
(acquired) motivation to avoid subsequent 
exposures to internal and external cues which 
were present at the time the fear reaction was 
aroused. This proposition is based on the 
postulate that fear is a stimulus-producing 
response which has the functional properties 
of a drive (2, 7).8 

SIn the sphere of human communication, the key 
theoretical assumption could be formulated as follows: 
If rehearsal of the reassuring statements contained in a 
communication fails to alleviate the emotional tension 
elicited by the use of a fear appeal, the audience will 
be motivated to continue trying out other (symbolic or 
overt) responses until one occurs which succeeds in 
reducing fear to a tolerable level. Thus, a strong fear 
appeal which is intended to motivate the audience to 
take account of a realistic threat of danger could have 
the paradoxical effect of motivating the audience to 
ignore the threat or to adopt “magical,” “wishful” or 
other types of reassuring beliefs that are antithetical to 
the communicator’s intentions. Morcover, according to 
the same theoretical assumption, when a communication 
produces a high degree of persistent fear, the audience 
will be motivated to engage in overt escape activities, 
some of which may prove to be incompatible with the 
protective actions recommended by the communicator. 
Unintended effects of this kind can be regarded as 
spontaneous “defensive” reactions which are motivated 
by residual emotional tension. In the present experi- 
ment, it would be expected that, in addition to the 
tendency tc avoid thinking about the threat, other 
defensive reactions would also occur. For example, 
following exposure to the Strong appeal, some of the 
students may have succeeded in alleviating their residus! 


emotional tension through spontancous interpersonal 
communication with fellow students. 


In the context of the present experiment, 
one would predict that the group displaying 
the greatest degree of residual fear would be 
most strongly motivated to ward off those 
internal symbolic cues—such as anticipations 
of the threatening consequences of improper 
dental hygiene—which were salient during 
and immediately after the communication. 
This prediction seems to be fairly well borne 
out by the evidence on carry-over effects, par- 
ticularly by the finding that the greatest 
degree of resistance to the subsequent coun- 
terpropaganda was shown by the group 
which had been least motivated by fear. The 
use of the Strong appeal, as against the 
Minimal one, evidently resulted in less rejec- 
tion of a subsequent communication which 
discounted and contradicted what was said 
in the original communication. In effect, the 
second communication asserted that one 
could ignore the alleged consequences of 
using the wrong type of toothbrush, and, in 
that sense, minimized the dangers which pre- 
viously had been heavily emphasized by the 
fear-arousing communication. 

The results obtained from the students’ 
reports on their dental hygiene practices could 
be interpreted as supporting another predic- 
tion from the same hypothesis. It would be 
expected that those students who changed 
their practices, after having heard and seen 
one of the three forms of the illustrated talk, 
were motivated to do so because they recalled 
some of the verbal material which had been 
given in support of the recommendations, 
most of which referred to the unfavorable 
consequences of continuing to do the 
“wrong” thing. In theoretical terms, one 
might say that their conformity to the recom- 
mendations was mediated by symbolic re- 
sponses which had been learned during the 
communication. The mediating responses 
(anticipations, thoughts, or images) acquired 
from any one of the three forms of the illus- 
trated talk would frequently have, as their 
content, some reference to unpleasant conse- 
quences for the self, and consequently would 
cue off a resolution or an overt action that 
would be accompanied by anticipated success 
in warding off the threat. But defensive 
avoidance of the mediating responses would 
reduce the amount of conformity to whatever 
protective action is recommended by the 
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fear-arousing communication. Hence the 
prediction would be that when rehearsal of 
statements concerning potential danger is 
accompanied by strong emotional tension 
during and after the communication, the 
audience will become motivated to avoid re- 
calling those statements on later occasions 
when appropriate action could ordinarily be 
carried out. An inhibiting motivation of 
this kind acquired from the illustrated talk 
would tend to prevent the students from 
adopting the recommended changes in their 
toothbrushing habits because they would fail 
to think about the unpleasant consequences 
of improper dental hygiene at times when 
they subsequently perform the act of brush- 
ing their teeth. 

Much more direct evidence in support of 
the “defensive avoidance” hypothesis comes 
from the analysis of spontaneous write-in 
answers in which the students explained why 
they disagreed with the counterpropaganda 
(Table 8). Those who had been exposed to 
the least amount of fear-arousing material 
were the ones who were most likely to refer 
to the illustrated talk as an authoritative 
source and to make use of its arguments. 
The relative absence of such references in 
the spontaneous answers given by those who 
had been exposed to the Moderate and 
Strong appeals implies a tendency to avoid 
recalling the content of the fear-arousing 
communication. 

Although the various pieces of evidence 
discussed above seem to fit together, they 
cannot be regarded as a conclusive demon- 
stration of the defensive avoidance hypothe- 
sis. What our findings clearly show is that 
a strong fear appeal can be markedly less 
effective than a minimal appeal, at least under 
the limited conditions represented in our 
experiment. Exactly which conditions and 
which mediating mechanisms are responsible 
for this outcome will remain problematical 
until further investigations are carried out. 
Nevertheless, so far as the present findings 
go, they consistently support the conclusion 
that the use of a strong fear appeal will tend 
to reduce the over-all success of a persuasive 
communication, if it evokes a high degree of 
emotional tension without adequately satisfy- 
ing the need for reassurance. 


SUMMARY AND CONCLUSIONS 


The experiment was designed to investigate 
the effects of persuasive communications 
which attempt to motivate people to conform 
with a set of recommendations by stimulating 
fear reactions. An illustrated lecture on 
dental hygiene was prepared in three differ- 
ent forms, representing three different inten- 
sities of fear appeal: the Strong appeal 
emphasized and graphically illustrated the 
threat of pain, disease, and body damage; the 
Moderate appeal described the same dangers 
in a milder and more factual manner; the 
Minimal appeal rarely referred to the un- 
pleasant consequences of improper dental 
hygiene. Although differing in the amount 
of fear-arousing material presented, the three 
forms of the communication contained the 
same essential information and the same set 
of recommendations. 

Equivalent groups of high school students 
were exposed to the three different forms of 
the communication as part of the school’s 
hygiene program. In addition, the experi- 
ment included an equated control group 
which was not exposed to the dental hygiene 
communication but was given a similar 
communication on an irrelevant topic. Alto- 
gether there were 200 students in the experi- 
ment, with 50 in each group. A questionnaire 
containing a series of items on dental hygiene 
beliefs, practices, and attitudes was adminis- 
tered to all four groups one week before the 
communications were presented. In order to 
observe the changes produced by the illus- 
trated talk, postcommunication question- 
naires were given immediately after exposure 
and again one week later. 

1. The fear appeals were successful in 
arousing affective reactions. Immediately 
after the communication, the group exposed 
to the Strong appeal reported feeling more 
worried about the condition of their teeth 
than did the other groups. The Moderate 
appeal, in turn, evoked a higher incidence of 
“worry” reactions than did the Minimal 
appeal. 

2. The three forms of the illustrated talk 
were equally effective with respect to (a) 
teaching the factual content of the com- 
munication, as assessed by an information 
test, and (4) modifying beliefs concerning 
four specific characteristics of the “proper” 
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type of toothbrush. The evidence indicates 
that the emotional reactions aroused by the 
Strong appeal did not produce inattentiveness 
or reduce learning efficiency. 

3. As compared with the other two forms 
of the communication, the Strong appeal 
evoked a more mixed or ambivalent attitude 
toward the communication. The students 
exposed to the Strong appeal were more 
likely than the others to give favorable ap- 
praisals concerning the interest value and the 
quality of the presentation. Nevertheless, 
they showed the greatest amount of subjective 
dislike of the communication and made more 
complaints about the content. 

4. From an analysis of the changes in each 
individual's reports about his current tooth- 
brushing practices, it was found that the 
greatest amount of conformity to the com- 
municator’s recommendations was produced 
by the Minimal appeal. The Strong appeal 
failed to produce any significant change in 
dental hygiene practices, whereas the Mini- 
mal appeal resulted in a reliable increase in 
conformity, as compared with the Control 
group. Similar findings also emerged from 
an analysis of responses which indicated 


whether the students had gone to a dentist 
during the week following exposure to the 


illustrated talk, reflecting conformity to 
another recommendation made by the com- 
municator. ‘The evidence strongly suggests 
that as the amount of fear-arousing material 
is increased, conformity to recommended 
(protective) actions tends to decrease. 

5. One week after the illustrated talk had 
been presented, exposure to counterpropa- 
ganda (which contradicted the main theme 
of the original communication) produced a 
greater effect on attitudes in the Control 
group than in the three experimental groups. 
The Minimal appeal, however, proved to be 
the most effective form of the illustrated talk 
with respect to producing resistance to the 
counterpropaganda. The results tend to su:p- 
port the conclusion that under conditions 


where people are exposed to competing com- 
munications dealing with the same issues, the 
use of a strong fear appeal is less successful 
than a minimal appeal in producing stable 
and persistent attitude changes. 

6. The main conclusion which emerges 
from the entire set of findings is that the 
over-all effectiveness of a persuasive com- 
munication will tend to be reduced by the 
use of a strong fear appeal, if it evokes a 
high degree of emotional tension without ade- 
quately satisfying the need for reassurance. 
The evidence from the present experiment 
appears to be consistent with the following 
two explanatory hypotheses: 

a. When a mass communication is de- 
signed to influence an audience to adopt 
specific ways and means of averting a threat, 
the use of a strong fear appeal, as against a 
milder one, increases the likelihood that the 
audience will be left in a state of emotional 
tension which is not fully relieved by rehears- 
ing the reassuring recommendations con- 
tained in the communication. 

5. When fear is strongly aroused but is not 
fully relieved by the reassurances contained 
in a mass communicz‘tion, the audience will 
become motivated to ignore or to minimize 
the importance of the threat. 
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He followers of the “New Look”?* in 
perception theory stress the role of the 
perceiver. Although it has long been 
recognized that the perceiver does indeed 
make a significant contribution to the percept, 
these experimenters have demonstrated the 
relationship between the psychological organi- 
zation of the perceiver and certain character- 
istics of the percept. The less well defined 
the stimulus situation, or the more emotion- 
ally laden, the greater will be the contribution 
of the perceiver. This principle is, in fact, 
the basis of the projective techniques. 

The perception of people, or more specifi- 
cally, the judgment of personality characteris- 
tics of people from their physical appearance, 
would seem to be an area in which the 
observer makes a great contribution to ‘the 
percept. It has often been demonstrated that 
judgments of this type possess little validity.” 
Razran (20) has shown that the social stereo- 
types of the subject influence these percep- 
tions, and Allport and Kramer (1) have 
shown that social attitudes may sensitize an 
individual to various cues in the stimulus 
field and may even increase the validity of 
certain judgments. One projective tech- 
nique, the Szondi, is based on a supposed 
unconscious identification with the persons 
photographed (8). 

Despite the promise of this area of investi- 
gation, and despite its importance in every- 
day behavior, relatively few investigations 
have been made. It has been demonstrated, 
however, that a subject’s value system may 
influence his perception of (other) ambiguous 
situations (17). Would the value system also 
influence the perception of people? Would 

1This term was first used by Krech and Crutch- 
field (10) and later by Luchins (13) and by Bruner (5). 
It is a convenient method of describing the personality 
oriented approach to perception that is perhaps best char- 
acterized in the chapters by Bruner, Frenkel-Brunswik 
and Klein in the book edited by Blake and Ramsey (4) 


2 A brief review of the literature on this subject may 
be found in Brunswik (6, p. 180). 


this influence extend to all judgments of the 
stimuli? The purpose of this study is to in- 
vestigate the answers to these questions. 

It is believed that an individual’s value 
system may influence his judgment of people 
in two major ways: that the values may serve 
as a frame of reference for the judgments 
and, under certain conditions, may be pro- 
jected into the stimulus-figure. Two specific 
hypotheses will be tested in this experiment. 

Hypothesis A: Each subject (S) will use 
his own high values as anchoring points for 
judging or describing the personality of 
another person based on his physical appear- 
ance. This hypothesis is suggested by the 
studies of Sherif (22) which indicate that an 
interaal frame of reference is used in a situ- 
ation where the S has few objective cues. It 
is also suggested by such studies as that of 
Tresselt and Volkmann (26) which show 
that the past experience of an individual may 
serve as a frame of reference in such psycho- 
physical tasks as the judgments of weights. 
In common experience, the same hypothesis 
seems to be plausible. Persons interested in 
commercial success often judge college pro- 
fessors in terms of their yearly salaries. Aca- 
demic psychologists often judge clinicians in 
terms of their research productivity. 

Hypothesis B: The observers’ major values 
will be projected in a discriminative fashion 
so that a general “halo” effect may be dis- 
cernible. Thus it would be predicted that the 
assignment of values in which the S himself 
is high will be associated with a feeling of 
liking the stimulus picture. This prediction 
follows from Murray's observation that “The 
complement of empathy is projection. We 
feel something (by empathy) and we imagine 
that the other person feels the same (projec- 
tiun}” (16, p. 247). Indeed, in the Thematic 
Apperception Test, S identifies with the 
figure and projects into these figures his own 
emotional characteristics. 
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MetTHop 


Apparatus and Procedure 


The stimuli for this experiment consisted of 24 
photographs, 12 of men and 12 of women. Orig- 
inally 55 pictures of women and 66 pictures of men 
were selected from back issues of Time magazine. 
The 24 finally selected were those which a group of 
8 judges chose as representing the widest distribu- 
tion of impressions in physical appearance, age and 
possible personality characteristics. These stimuii 
pictures were 2.5X3.5 in. and were mounted on 
plain white paper 4X5 in. Each photograph was a 
portrait type showing only the head and shoulders 
of the person. All of the stimulus persons were 
conventionally dressed and evidenced no primary 
cues to suggest social, ethnic, or occupational group- 
ings. In actuality they represented a range of 
notoriety varying from murderers to college 
presidents. 


Subjects 

Ss of the experiment were 28 male college 
students ranging in age from 19 to 53 years. All 
were attending college at the time of this experi- 
ment. These Ss were tested individually on four 
tasks: 

Task A: A seven-step, closed-end, like-dislike 
rating scale was devised. The Ss were to rate each 
stimulus picture on this scale. To check on the 
possibility that these ratings might have been 
unduly influenced by the apparent age of the 
stimulus person, two scales were used. One scale 
called for like-dislike as a friend, the other as a 
neighbor. Each S, therefore, actually made two 
ratings of each picture. 

Task B: The Allport-Vernon Study of Values (2) 
was administered to S. 

Task C: A set of six statements was presented to 
S. Each of these statements described one of the 
values of the Allport-Vernon and was adapted from 
the manual for that test. Thus the theoretical 
value became: “Chief aim in life is to advance 
knowledge through scholarship or research”; the 
political value read: “Interested mostly in personal 
power, influence and renown,” etc. S was to select 
the statement which “best” described each stimulus 
photograph and the statement which “least” de- 
scribed each. 

Task D: Each S was asked to rank the six 
statements used in Task C as they applied to him- 
self. He was also asked to rate himself on the like- 
dislike scales, ic., to rate himself on how other 
people would react to him as a friend or neighbor. 

In order to prevent any constant transfer from 
one task to the next, Ss were divided into four 
groups. Each group was presented the above tasks 
in different orders. A-B-C-D, B-D-A-C, D-C-B-A, 
and C-A-D-B were the orders used. No S reported 
more than a slight amount of insight into the pur- 
pose of the experiment. 


REsuLts 


The ratings on the two different (for 
friend, for neighbor) like-dislike scales were 


compared for each S. Only three Ss showed 
ratings which differed significantly from one 
scale to the other. Since the purpose of using 
two scales was to control the possible influ- 
ence of the apparent age of the stimulus pic- 
ture upon the like-dislike judgments, it was 
concluded that this influence was minimal. 
The results presented in the tables, therefore, 
are based only on the ratings on the “friend” 
scale. 


Hypothesis A. 


Rank-difference correlation coefficients were 
calculated to compare the rankings of each 
S’s values according to the Allport-Vernon 
and according to the S’s self-ranking. The 
median coefhcient of correlation was .66. 


TABLE 1 


Mean Frequency or Use or Susyect’s Hicuest 
Two Vatues anp Lowest Two Vatues as 
DETERMINED BY THE ALLPORT-VERNON 








Least 


L 


Best Tora 











* Difference significant at the .o5 level of confidence. 
** Difference significant at .o1 level of confidence. 


However, about one-third of the group dif- 
fered significantly from a chance correlation? 
The results of both the self-rating and All- 
port-Vernon scores for determining the S’s 
two highest and two lowest values are pre- 
sented separately. 

A count was made of the number of times 
each of the above values was used to describe 
the stimuli and the group means were deter- 
mined. The results are shown in Tables 1 
and 2. These tables show not only the aver- 
age number of times the values were used, 
but also the average number of times they 
were used as “best” and as “least” in describ- 
ing the photographs. The results for pictures 
of women and pictures of men are presented 
separately. Examination of Table 1 and 
Table 2 shows that, in all but one instance, 

8 Interpolating from the table for testing the signifi- 
cance of rho prepared by DuMas(9, p. 435), an 


obtained rho of .82 was considered to differ significantly 
from a chance relationship. 
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TABLE 2 


Mean Frequincy oF Use or Susyect’s HicHest 
Two Vatues anp Lowest Two VALUES as 
DETERMINED BY SELF-RANKINGS 








Best Least Tora 


ihe * H L H 





STImMuLus 





Men 6.9 
Women 7.6 
Total 4-6 


3:7 4 
3-4 4: 
9.3 8. 


ad ° ‘ I 





* Difference significant at the .o1 level of confidence. 


low values were more frequently used to 
describe the stimuli than were high values. 
Those differences that are statistically re- 
liable are indicated. 

The use of middle values falls halfway 
between the frequency of use of the high 
values and the low values. Thus, contrary 
to Hypothesis A, it must be concluded that 
under the conditions of this experiment the 
lower the value was, the more frequently it 
was used to describe the photographs. 


Hypothesis B 


The stimulus pictures were then divided 
into groups based on the values assigned to 
them by each S. All the pictures which were 
given S’s highest values as best describing 
them were placed in one group, those attrib- 
uted the lowest values as best describing them 
were placed in another group.* A similar 
procedure was followed for the values con- 
sidered least descriptive of the stimuli. The 
high and low values as determined by the 

4 By high values and low values, the two values with 


the highest ranks and the two values with the lowest 
ranks are indicated. 


Allport-Vernon and as determined by the 
self-rankings were treated separately. 

The like-dislike ratings for the pictures in 
each of the above groups were noted and 
average ratings were compiled for each S. 
These ratings in turn were used to produce a 
mean like-dislike rating for each group. The 
means are shown in Table 3. On the rating 
scale used, the most liked rating received a 
numerical score of 1 and the most disliked 
rating a numerical score of 7. Therefore, 
the lower the mean, the better the degree of 
liking. Conversely, the higher the mean, the 
more dislike is indicated. The midpoint of 
the scale was 4.00. 

Examination of Table 3 shows that when 
high values were assigned as “best” describ- 
ing the pictures, these pictures were better 
liked, i.e., had lower means, than when low 
values were assigned. Where high values 
were assigned as being “least” descriptive of 
the pictures, those pictures tended to be dis- 
liked. Almost all of the obtained differences 
were statistically significant. 

Although also true of the previously pre- 
sented data, Table 3 shows most clearly that, 
although the results obtained through the use 
of both techniques of ranking values were 
parallel, the self-ranking method seems to be 
the more sensitive. The differences obtained 
with this method are larger and have higher 
statistical significance than the differences ob- 
tained when the values were ranked accord- 
ing to the scores on the Allport-Vernon. The 
most probable explanation of these results is 
in terms of the relative validity of the two 
techniques. However, another suggestion is 
that the significant variable is not the actual 
value structure but S’s concept of it. The 


TABLE 3 


Mean Ratincs on Lixe-Distixe ScaLte For Puotocrapus AssicNep Susyects’ Hicnest Two Vatues anp 
Lowest Two Vatues ¢ 








ALLPorT- VERNON Seir-RanKING 








H L 





3.92%° 
4.16 
4-11° 


Men 
Women 
Total 


4.00 
4-17 
4-22 


3-27 
3-74 
3-56 





+ Low ratings are most liked; high ratings, most disliked. 
® Difference significant at the .05 level of confidence. 
®® Difference significant at the .o1 level of confidence. 
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data of this experiment, however, furnish no 
specific evidence on this point. 

The data also indicate that the phenomena 
noted apply more to the judgments of pic- 
tures of men than to the judgments of 
pictures of women. Since all Ss in this 
experiment were men, it is inconclusive as to 
whether a social or cultural stereotype is 
involved or whether a “same-sex—different- 
sex” variable is the important element. Fur- 
ther studies are contemplated to examine this 
aspect of the problem. 

The Ss of the experiment also rated them- 
selves on the like-dislike scales. The great 
majority of them rated themselves as mildly 
liked, a few Ss rated themselves as greatly 
liked, and a few as neither liked nor disliked. 
No other points on the rating scales were 
used. 


Discussion 


The evidence that Ss of this experiment 
tended to use their lowest values most fre- 
quently in describing the stimuli leads to a 
rejection of the first hypothesis formulated. 
This hypothesis predicted that the S’s high 
values would be used as the anchoring points 
in the judgments of the photographs. The S$ 
actually utilizes his high and low values as 
an interiorized frame of reference, but in this 
experiment the low values were more often 
used as anchoring points than were the high 
values. 

Whether or not S uses his high or low 
values as points of reference seems to be a 
result of several influences. One influence 
may be mental set or readiness. Studies by 
Postman, Bruner, and McGinnies (17) and 
by McGinnies (15) have shown that in the 
identification of words or in the response to 
words there is a readiness or a set on the part 
of Ss to respond in terms of their dominant 
values. Even guesses were not haphazard 
but reflected Ss’ high values. Their studies 
lead to the conclusion that “As frequently as 
possible and as long as possible perceptual 
guesses are made in congruence with prevail- 
ing value orientation” (17, p. 153). Although 
the judgments made in the present study may 
be considered in the nature of “perceptual 
guesses” this formulation does not apply. 
The very opposite is found. 

The difference in the results obtained in the 
present investigation and those cited above 


may be due to attitudes or stereotypes. All- 
port and Kramer (1) report a study dealing 
with the perception of ‘photographs. Their 
experiment called for S’s identification of 
minority group members from a larger group 
of photographs. The relevant conclusion is 
that the Ss with greater prejudice identified 
more pictures as members of the minority 
group than did Ss with lesser prejudice. 
Although Carter (7) questions the degree of 
relationship, his own results in a similar situ- 
ation verify the fact that there is such a rela- 
tionship. Lindzey and Rogolsky (11) also 
corroborate this finding.® This phenomenon 
is explained by stating that the consequences 
of failing to recognize a minority group 
member are greater for the prejudiced than 
for the nonprejudiced. Such recognition is 
a reinforced activity among the prejudiced 
but not among the nonprejudiced. 

This interpretation of the effect of past 
experience may be used to account for the 
present results. Tresselt (25) has shown that 
having had some experience in judging a 
limited range of weights, and then being 
given an expanded range of weights for judg- 
ment, will cause a shift of the judgmental 
values. A speculative explanation in this ex- 
periment therefore may be ventured in terms 
of the “strangeness” or psychological distance 
of the photographs being judged. This ex- 
planation postulates that the entire value 
system of the individual forms a frame of 
reference and so is very similar to the “light- 
medium-heavy” frame of reference used in 
the judgments of weights. Each value occu- 
pies a definite and perhaps equal area of the 
continuum. Because of the “strangeness” of 
the stimulus pictures, the adequacy of the 
familiar frame of reference was found lack- 
ing. The frame, therefore, had to be ex- 
panded to include these new stimuli and was 
expanded at the low end. This inclusion 
caused all values to expand in area. How- 
ever, as the stretching or expansion was at the 
low end of the scale, the low values expanded 
most in area,and the high values expanded 
least. The low values now covered the largest 
area of the continuum and hence were most 
frequently used. It is probable that further 
use of the expanded scale would make for a 

5 Although Marks (14) has criticized some of the 


interpretations of Allport and Kramer, he makes no 
mention of these particular data. 
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reorganization of the categories so that each 
value occupied approximately equal areas. 

The frame of reference could also be ex- 
panded in the direction of the high values. 
It may be speculated that this is what hap- 
pened in the previously cited study by 
Postman, Bruner, and McGinnies. Some- 
thing in the experimental conditions, perhaps 
the use of words as stimuli or the prestige of 
the experimenters, may account for the ex- 
pansion and may account for the increased 
readiness to respond to high values and to 
use these values in guesses. 

Solomon and Howes (23) and Postman 
and Schneider (18) have shown experiment- 
ally that the familiarity of the stimulus word 
is an important variable in determining dura- 
tion thresholds. Both experiments show that 
when the frequently seen words (as deter- 
mined by the Thorndike-Lorge Word List) 
were used as stimuli, the value systems of Ss 
played no role in determining the duration 
threshold. When less frequently seen words 
were the stimuli, the values of Ss did influ- 
ence these thresholds. 

Again it may be speculated that when S$ 
responded to the relatively frequent words, 
his frame of reference was “at rest,” i.e., in the 
normal state. Each value occupied approxi- 
mately the same area of the continuum. The 
readiness to respond under these conditions 
would be the same for all values. The rela- 
tively infrequent word stimuli required a 
longer recognition and, to a highly motivated 
S, this might have contributed to a mild 
stress. The Ss would have to expand their 
frames of reference, in this instance in the 
high direction, to include the previously less 
frequently used words in order to establish 
duration thresholds equal to the thresholds 
for frequently used words. 

In the present study, the inadequacy of the 
anchoring points ordinarily used may not be 
due to the difficulty of making the judgments 
as much as to the strangeness of the photo- 
graphs. The extension of the scale may be 
due to the Ss’ recognition of the difference 
between themselves and the stimulus group 
and may be their way of saying, “We are 
different.” 

A somewhat similar rationale has been 
advanced by Thompson (24) as a basis for 
modifying the TAT to include the use of 
Negro figures. He believed that the use of 


Negro figures as stimuli in the cards pro- 
duced a greater empathy among Negro Ss 
than did Murray’s version of the test which 
used no such figures. Although Riess, 
Schwartz, and Cottingham (21) have dis- 
proved this assumption, their objection does 
not appear to be the use of Negro figures 
per se but to the alien settings in which they 
were displayed. In the present study, the 
figures were not set in any social context and 
their physical appearance alone may have 
marked them as different. 

This speculative explanation should be rela- 
tively easy to test. Should the present study 
be repeated using photographs from college 
yearbooks, there should be an increased use 
of high values by the Ss. Should the experi- 
ment on the identification of minority group 
members use photographs that will appear 
“stranger” than those hitherto used, there 
should be a corresponding increase in the fre- 
quency with which such membership is 
attributed to the stimuli by the prejudiced S. 
Indeed, the surprisingly high correct identi- 
fication of psychopathology from among the 
Szondi pictures (19) may be due to the very 
strangeness of these pictures. 

The “halo” effect predicted by the second 
hypothesis formulated in the introduction to 
this paper is clearly shown by the data. The 
closer the value system projected into the 
stimuli resembled the value system of a sub- 
ject, the greater was the liking. The less the 
value system projected into a stimulus photo- 
graph resembled that of the S, the greater was 
that photograph disliked. 

No cause and effect relationship is neces- 
sarily involved in this association of variables. 
It does corroborate the finding of the “halo” 
effect in judgments of photographs by Bruns- 
wik (6). In the latter study, however, the 
high and low ratings were all explicit in the 
experimental situation. A high rating in 
intelligence or in leadership is generally 
considered desirable. In the present study, 
one of the variables was implicit in the Ss. 
Their individual value systems determined 
whether a certain description was high or 
low. Perhaps if Brunswik had considered 
the value placed upon leadership or intelli- 
gence by his Ss, his indices of the “halo” 
effect might have been even greater. The 
variables upon which the photographs in his 
experiment were rated, however, had strong 
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cultural values to which the individual values 
probably conformed. In an experiment 
dealing with other variables, or in an experi- 
ment dealing with less culturally oriented 
Ss, e.g. psychotics, this effect may be of 
major importance. 

The association between the two variables 
of this experiment may come from an over- 
all impression of the personality of the stim- 
ulus that had been formed by the Ss. Asch 
(3) and Luchins (12) suggest that such 
over-all impressions are formed. The like- 
dislike rating, then, may be an index of the 
amount of identification existing between 
S and the stimulus. 


SUMMARY AND CONCLUSIONS 


1. The purpose of this experiment was to 
study the influence of an individual’s value 
system upon his perception of people. Two 
hypotheses were tested. (@) An individual's 
high values will serve as anchoring points for 
his judgments of people. (4) A “halo” effect 
will be found where the nearer the stimulus 
was judged as resembling S in terms of 
values, the more that stimulus will be liked. 

2. The 28 Ss of this experiment made judg- 
ments of each of 24 portrait-type photographs. 
They rated each photograph on like-dislike 
scales and also indicated the value they be- 
lieved would ‘best describe the person pictured 
and that value which would least describe 
him. Each S also had his own value sys- 
tem determined through the Allport-Vernon 
Study of Values and through a self-ranking 
technique. 

3. It was found that Ss most often used 
their own low values to describe the stimuli. 
This result is interpreted as rejecting the first 
hypothesis proposed. An explanation is sug- 
gested in terms of the “strangeness” of the 
stimuli. 

4. The results support the second hypothe- 
sis concerning the “halo” effect. Photographs 
that were most liked were attributed values 
that most closely resembled those of the S. 
This relationship may be attributed to the 
fact that both may stem from an over-all im- 
pression of personality formed by the S. 
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stated is that there is a tendency for cer- 

tain individuals to have repeated acci- 
dents. This was postulated to account for 
the early discovery that sample populations 
of adult workers included individuals who 
had more accidents than could be accounted 
for by chance alone. More recent findings 
appear to minimize the importance of the 
statistical evidence for the accident proneness 
hypothesis (3). 

If we are willing to accept Cobb’s (3) 
assignment of the variance in accident dis- 
tributions to both chance and differential 
liability factors, however, it follows that cer- 
tain individuals might be included among the 
high accident group by chance alone, while 
others are included because of differential 
liability. ‘There are indications in the clinical 
literature to suggest that there are individuals 
who by virtue of special personal characteris- 
tics continue to have repeated accidents (4, 5, 
6, 13). It is this writer’s suggestion that these 
are the people whom it is necessary to isolate 
for adequate validation of the accident prone- 
ness hypothesis as a clinical concept. King 
has pointed out: 


T= accident proneness hypothesis simply 


The test of the hypothesis lies in the success or 
failure to identify causal physiological or psycho- 
logical characteristics, or their combination, present 
in individuals showing a high frequency of acci- 
dents, and absent or present to a lesser degree in 
“accident free” individuals with a comparable back- 
ground of experience (9, p. 374). 


The present study was an attempt to vali- 
date the accident proneness hypothesis as a 


1 This paper is a portion of a di.sertation submitted 
to the Department of Psychology of the University of 
Rochester in partial fulfillment of the requirements for 
the Ph.D. degree. The writer is indebted to Professor 
G. R. Wendt of the Department of Psychology for advice 
and assistance throughout the investigation, and to the 
students, teachers, and principals of the schools in Roch- 
ester and Brighton who cooperated to make this study 
possible. She is also indebted to the Strong Memorial 
and Municipal Hospital, the Genesee Hospital, the 
Rochester General Hospital, and the Highland Hospital 
for the privilege of using their records to select the acci- 
dent subjects. 


clinical concept. It was felt that an investi- 
gation of personality characteristics which 
predispose to repeated accidents in children 
might throw light on the reasons for child- 
hood accidents as well as on the concept of 
accident proneness as a whole. If character- 
istics could be identified in childhood which 
are similar to those found for adults, this 
would add further validation to the clinical 
concept of accident proneness and might sug- 
gest that accident liability is continuous from 
childhood to maturity. 


THEORETICAL FoRMULATION 


The hypotheses which were tested in this 
experiment were derived from the theoretical, 
clinical, and experimental literature (1, 4, 5, 
6, 7, 13). The focal agreement of this litera- 
ture appears to be that the accident prone 
individual grows up in an authoritarian, strict 
home environment, in which feelings of rejec- 
tion and resulting hostility are fostered. The 
accident is said to occur as a result of inward 
turning of aggression. 


Frustration-Induced Aggression 


The source of the accident prone child’s 
aggression is believed to be frustration of 
nurturance. It is assumed that all children 
have some drive toward nurturance, and that 
in the accident prone child, brought up in an 
authoritarian, rejecting home environment, 
this drive has been frustrated. From this 
emerges the first hypothesis: (@) that the 
accident prone child has more frustration- 
induced aggressive drive than the accident 
free child. We should expect that aggressive 
drive will be expressed in aggressive behavior 
in a permissive, nonpunitive situation. 


Aggression Anxiety 


The accident prone child is believed to fear 
the loss of parental love, a form of punish- 
ment, as his hostility is assumed to be dirécted 
against parental figures. If it is also assumed 
that he is punished for direct and overt ex- 
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pression of aggression, the second hypothesis 
is indicated: (4) that the accident prone child 
has more guilt and fear of punishment at the 
overt and direct expression of frustration- 
induced aggressive drive. (This avoidance 
drive at the expression of aggression is called 
aggression anxiety in this paper.) To support 
this hypothesis aggressive behavior should be 
inhibited and delayed, and its form should 
show distortion. It is possible to test this 
hypothesis in another way, if we equate 
aggressive drive with the approach gradient, 
and aggression anxiety with the avoidance 
gradient (2, 14, 15). On the basis of the find- 
ing that the avoidance gradient is steeper than 
the approach gradient, it would be expected 
that when aggression anxiety is extinguished 
in a permissive situation the accident repeat- 
ing child will show a greater increase in 
aggressive behavior and a greater decrease in 


measures of aggression anxiety than the acci- 
dent free child. 


Family Atmosphere 


While the relationships between aggression 
and aggression anxiety are the central fac- 
tors to be studied in this investigation, the 
hypotheses concerning them are based on 
the antecedent assumptions concerning the 
authoritarian home environment and frustra- 
tion of the nurturance drive. On the assump- 
tion that the accident prone child comes from 
a highly controlled environment, which is 
also one of limited affectionate contacts be- 
tween adults and children, we may state the 
third hypothesis: (¢) this atmosphere of 
dominance and command, and limited affec- 
tionate contacts is reflected in the accident 
prone child’s perception of reality. We may, 
therefore, expect that when a permissive situ- 
ation is structured on the basis of this experi- 
ence, there will be a high proportion of 
commands, threats, and prohibitions, a low 
proportion of affection, and a high proportion 
of attempts to solicit affection. 


METHOD 


The choice of method was determined by the 
need to measure both aggressive drive and aggres- 
sion anxiety. We have proposed for the accident 
repeating child, however, that fear and guilt will 
prevent the free expression of aggression, under the 
assumption of algebraic summation of the approach 
and avoidance drives. The solution to this apparent 
dilemma appeared to be the use of a permissive, 


relatively unstructured situation, a standardized 
doll play interview. Miller (16) has pointed out 
that in order to detect the instigation to aggression 
in cases where even the less overt acts of aggression 
have been inhibited, it is necessary to reduce the 
competing instigations (or the aggression anxiety, 
in this case). It was hoped that in the doll play 
situation, where expression of aggression is not 
punished, some extinction of aggression anxiety 
would take place and the manifestation of aggres- 
sive behavior, as well as other behavior, might be 
studied. 


Materials 


Two 20-minute standardized dol! play interviews 
contributed the major results of this study. Since 
it was desirable to limit organizational activity and 
promote thematic material, particularly of an 
aggressive nature, a doll house of high realism and 
maximum organization was designed and con- 
structed (17, 18). It is a portable, folding doll 
house whose dimensions are 24 in. wide, 37 in. long, 
and 4 in. deep when opened in experimental posi- 
tion. It consists of six rooms, in which most of the 
furniture is glued to the floor: The fantasy support 
is that of the home environment, and the doll 
family was a standard one (father, mother, boy, 
girl, and baby). The dolls used were the Flagg 
Flexible Dolls, which are made of a flexible, plastic 
material, easily manipulated by children. 


Instructions 


The experimenter (E) gave the following direc- 
tions to the children: 

This is a little six-room house. Most of the furniture 
is glued down and so you can’t move it. This is the 
furniture that you can move. (Point out kitchen chairs, 
high chair, dining room chairs, piano stool, crib, and 
dressing table chair.) Here is the kitchen, dining room, 
the living room, the children’s bedroom, the bathroom, 
and the father and mother’s bedroom. You may use 
the dolls and the house in any way that you want to. 
The dolls can be bent into any position, and can be 
moved around. (Demonstrate flexibility of the father 
doll.) 


Opportunity was given at this time for the subject 
(S) to ask about the construction and manipulation 
of pieces. Then E stated: 

Now what we want you to do is to make up a story 
or a play about this house. It can be any kind of a 
story you can think of. 


Procedure 


Following the instructions, E recorded the child’s 
play behavior and verbalizations on ruled and 
squared off sheets of paper, one square being used 
for each 15-second time unit. An electric clock 
motor with a revolving cam made a contact every 
15 seconds to produce an auditory signal, which 
indicated to E when to move on to a new time unit 
on the recording sheet. (The E’s role was mini- 
mal.) Simultaneous recordings of verbalizations 
were made on a standard wire recorder, which was 
screened from the child’s view. The play behavior 
was later synchronized with the wire-recorded 
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material with the aid of the auditory signal. These 
synchronizations represented the complete verbatim 
transcriptions of the play sessions, which could 
then be scored. 


Doll Play Measures 


The scoring system which was developed to 
measure doll play behavior was based on the scoring 
categories developed in the Iowa Studies (17, 18, 
19) and on the findings of Levy (11, 12). Those 
scoring categories yielding the results reported in 
the article are as follows: 

Aggression. This category, designed to test the 
hypothesis concerning aggressive drive, included 
doll and subject attitudes, feelings, intentions, and 
actions of an injurious, critical, or destructive nature. 

Aggression Anxiety. a. Form of aggression. The 
subcategories of verbal aggression, verbal description 
of aggression and action aggression were separately 
scored to reveal differences in the amount of 
inhibition. 

5. Latency of aggression. The delay in express- 
ing aggression was measured by the number of time 
units preceding the expression of the first aggressive 
act in doll play. 

c. Inhibitions to aggression. This category in- 
cluded subject and doll denials of aggression, self- 
warnings against committing acts of aggression, 
requests for reassurance, and two subcategories of 
tangential activity. Tangential play was scored 
when S's doll play did not appear to be a story with 
“living character roles”; tangential behavior, when 
S asked about, played with, or explored material in 
the experimental room other than the doll house 
and the dolls themselves. 

Family Atmosphere. a. Authoritarian atmos- 
phere. Commands, prohibitions, and threats were 
scored in this category. 

4. Affection. Doll and subject attitudes, feelings, 
intentions, and actions of an affectionate, friendly, 
generous nature, or happy mood in general, were 
scored in this category. 

c. Solicitance of affection. Requests for help or 
affection by either subjects or dolls were scored in 
this category. 

Additional. a. Stereotyped thematic play. This 
included thematic action or phantasy characteristic 
of routines carried on in the usual home situation, 
appropriate to the time, place, situation, and char- 
acters involved. 

b. Nonstereotyped thematic play. All thematic 
action or verbalized phantasy concerning situations 
outside of the usual routines and not classified else- 
where were scored in this category. 

c. Experimenter interaction, This category was 
defined as E’s requests and replies to S. 

d. Activity level. This was measured by cumu- 
lating the total amount of doll play behavior. 

The method of scoring adopted was one of com- 
plete coverage. The scores were made in terms of 
behavior units. A behavior unit was determined 
by the completion of a goal-end action, and goal-end 


2The complete scoring system may be found in 
Appendix E of the unpublished doctoral dissertation (10). 
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actions were defined in terms of relevancy to the 
above scoring schema. All identifying data were 
removed from doll play protocols before scoring. 


Scoring Reliability 


Agreement between scorers. An independent 
scoring was made by an experienced clinician of 
five play-session protocols chosen at random. Relia- 
bility was defined in terms of the number of agree- 
ments and number of disagreements found between 
E and the independent scorer. An agreement was 
noted when the exact scoring subcategory was indi- 
cated for a time interval. Disagreements due to 
confusion with another subcategory were counted 
one-half against the first subcategory and one-half 
against the second one. The formula for computing 
reliability was, therefore: 

Number of agreements between observers A and B 
Number of agreements plus one-half the disagreements 





The agreement for the totals of each of the cate- 
gories ran from 43.6 per cent to 95.3 per cent, and 
the mean reliability was 68.2 per cent. 

Self agreement. The experimenter rescored four 
play sessions selected at random, more than six 
months after the original scoring. The agreement 
for the totals of each of the categories ran from 
60.5 per cent to 95.6 per cent, and the mean relia- 
bility was 80.7 per cent. These percentages of 
agreement compare very favorably with those re- 
ported in the doll play literature, in view of the 
fact that they are computed on a small sample 
according to a rigorous criterion of agreement. 


Subjects 


The accident repeating children studied in this 
investigation were located through the inpatient and 
outpatient records of five major hospitals in the 
City of Rochester, New York.? Only those five- to 
eight-year-old children who had had a record of 
three or more major or minor accidents in the four- 
year period prior to study (January, 1946, to Jan- 
uary, 1950) were selected for study. Children hav- 
ing epilepsy, mental deficiency, and organic brain 
disease were omitted. The accidents of the remain- 
ing eligible children were carefully screened to 
ascertain that they were not due to the carelessness 
of parents or other adults. Of the 2,649 children 
found to have accident records, only 53 children 
met the criteria. Twenty-one of these children 
were omitted for various reasons. No apparent 
selection factor which might differentiate these 21 
children from the other 32 was discovered. 

The 32 accident Ss who were included ir this 
study ranged in chronological age from § years, 
8 months to 8 years, 10 months, with a mean age 
of 7 years, 2 months. Their mean mental age 
equivalent on the Terman Vocabulary Test was 7-2, 
with a range from 4-0 to 10-9. These 32 children 


8 An analysis in terms of the kinds of accidents occur- 
ring to the five- to cight-year-old children brought to 
five major hospitals of the City of Rochester, New York, 
between 1946 and 1950 will be reported in a later 
publication. 
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had had a total of 110 accidents during the period 
surveyed, an average of 3.4 accidents per child, with 
a range of from 3 to 6 accidents. Falls (in terms 
of event) and lacerations (in terms of injury) com- 
prised the greatest number of accidents. These Ss 
are, in general, a minor injury population. 

The 32 control children were paired with the 
experimental children on the basis of sex, chrono- 
logical age, and menta! age equivalent scores on the 
Terman Vocabulary Test, and were selected from 
the same grades and schools as the experimental! 
children. The control children were an accident 
free group, determined by checking school health 
records, public health nursing records, and the sur- 
vey records of this study. The control Ss ranged 
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ardized doll play interview. In all cases, the control 
child was seen the same number of times as the 
experimental child. In no case was there Jess than 
24 hours between the Rosenzweig Picture-Frustra- 
tion Study and the first doll play session, or between 
the first and second doll play session, and there was 
never more than a week between doll play sessions. 


REsuLtTs AND Discussion 


Aggressive Drive 


Table 1 presents the totals for doll play 
measures designed to test the hypotheses re- 
lating to aggressive drive and aggression 


TABLE | 


Dott Pray Measures or Accression AND AccRESSION ANXIETY FoR Two Twenty-MINUTE STANDARDIZED 
Dott Pray INTERVIEWS 


(N=32 pairs of children) 








AccIDENT 
FREE 


AcCIDENT 
REPEATERS 


Dout Pray 
MEAsURES 


SIGMA 


ENCE 
Dirren Mean Dirr. 





Aggression 
Proportions 
Time Units 


Aggression Anxiety 
Form of Aggression**® 
Verbal 
V. Action -183 
Total Verbal 
Action 


Latency 78.06 59 


Inhibitions 
Proportions .271 
Time Units 59.00 


.298 
64.63 





* p values are for only one tail of the distribution. 
** Computed as a proportion of total aggression. 


in chronological age from 5 years, 8 months to 
8 years, 9 months, with a mean age of 7 years, 1 
month. Their mean mental age equivalent was 
7-4, with a range from 4-6 to 11-3. 


Experimental Design 


The children were seen in 14 public schools and 
7 parochial schools in the city of Rochester, and in 
2 schools in Brighton. Arrangements to see the 
children were made with the principals in order to 
keep the nature of the investigation confidential 
from the teachers and children. The control and 
experimental children were first paired by means of 
the Terman Vocabulary Test. Subsequent to this 
the Rosenzweig Picture-Frustration Study, Chil- 
dren’s Form, was administered.4 The next two 
sessions included sessions one and two of the stand- 


*The results obtained with the Rosenzweig Picture- 
Frustration Study, Children’s Form, will not be reported 
since they showed no significant differences which could 
not have been obtained by chance alone. 


anxiety.” The measures were scored both as 
percentages of total doll play behavior and in 
terms of the number of time units in which 
the behavior occurred. The results indicate 
that the accident repeating children studied 
in this investigation demonstrated more 
aggressive behavior in doll play than the acci- 
dent free children. These findings are in the 
direction of confirming the hypothesis con- 
cerning aggressive drive as stated above. 


Aggression Anxtety 


The results testing the aggression anxiety 
hypothesis show some inconsistencies. The 
accident repeating children demonstrated 

5 The data for each doll play interview and for boys 


and girls separately are given in detail in the unpub- 
lished doctoral dissertation (10). 
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significantly more of their aggression as 
verbal aggression and tended to show a 
smaller percentage of their aggression as 
action aggression than the accident free chil- 
dren. These results suggest the attenuation 
of the intensity of aggression, possibly as a 
result of aggression anxicty, in the expected 
direction. However, the accident free chil- 
dren showed more inhibitions and signifi- 
cantly longer latency than the accident 
repeating children. This was contrary to 
prediction, as these were considered to be 
measures of avoidance of the expression of 
aggression. 
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expect that with extinction of aggression 
anxiety in a permissive situation the verbal 
forms of aggression (less intense forms) 
would decrease more for the accident re- 
peaters than for the accident free, while the 
action forms (more intense forms) would 
show greater increases for the former. While 
the trend for verbal aggression alone did not 
follow this hypothesis, those for verbally 
mediated action aggression, total verbal ag- 
gression, and action aggression are in the 
expected directions. It is probable that sig- 
nificant differences for these changes were 
not found because two sessions were not 


TABLE 2 


Cuances 1n Dott Pray Measures FROM SESSION ONE To Session Two 


(N=32 pairs of children) 








Mean CHaNnce**® 





Dott Pray 


MEASURES ACCIDENT 


FREE 


ACCIDENT 
REPEATERS 


DIFFERENCE p* 





Aggression 
Proportions 
Time Units -75 


Latency 56 


Form of Aggression 
Verbal 
V. Action 
Total Verbal 
Action 


-005 
-007 
-032 
—.119 


044 


-045 
.034 


‘ -94 +20 
-09 ° -61 +30 


+59 +59 +30 
.008 ° .08 -50 
-O051 ‘ -88 +20 


.077 : -67 +30 
.085 a 83 -25 





* p values are for one tail of the distribution. 


** A mean change from session one to session two is an increase when the sign is negative and a decrease 


when the sign is positive. 


Table 2 presents the results for changes 
from session one to session two, a comparison 
designed as a further test of the aggression 


anxiety hypothesis. It was predicted that 
aggression anxiety would be extinguished in 
a permissive situation more quickly for the 
accident repeaters than for the accident free. 
Although no significant changes were found, 
they were all in the predicted direction. The 
accident repeating children demonstrated a 
greater increase in aggressive behavior from 
session one to session two than did the acci- 
dent free children. The accident repeating 
children also showed a greater decrease in 
latency from session one to session two than 
did the accident free children. For form of 
aggression to be in the direction of supporting 
the aggression anxiety hypothesis, we should 


enough to extinguish aggression anxiety 
completely. 


Aggression Anxiety Inconsistencies 


The limited and inconsistent support of 
the aggression anxiety hypothesis may be 
explained in severa! ways. It is possible that 
inhibitions and latency are not “pure” meas- 
ures of aggression. Other research (17, 21, 
22) suggests that tangentiality (inhibitions) 
and latency (delay) are correlated with ag- 
gressive drive, as well with aggression anxiety. 
An alternative explanation may be that the 
aggression anxiety hypothesis was over- 
weighted in the formulation of predictions 
for this study. This might have occurred by 
basing predictions on information in the lit- 
erature concerning the personality dynamics 
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of adult accident Ss, because of the limited 
number of publications on children with 
repeated accidents. Developmental differ- 
ences may, therefore, account for the differ- 
ence in findings. One would not expect 
well-developed superego inhibitions in chil- 
dren of this age, considered to be the major 
dynamic basis for inward turning of eggres- 
sion in the adult accident prone individual. 
The differences found may also be explained 
on the basis of differential developmental 
ability to discriminate reality restrictions, i.c., 
we may expect experience differences between 
children and adults in their ability to evaluate 
their own physical limitations. 


Vira Kray 


Authoritarian Atmosphere 


Table 3 includes the results concerning all 
family atmosphere categories. While the 
results for measurement of Authoritarian 
Atmosphere in doll play showed no signifi- 
cant differences, they were consistently in the 
direction of the accident repeating children 
showing more commands, threats, and pro- 
hibitions than the accident free children. In 
view of Jeffre’s (8) finding that this kind of 
doll play behavior appears to reflect the reality 
of the home environment, these results sug- 
gest that accident repeaters may come from 
an authoritarian atmosphere. 


TABLE 3 


Dott Pray Measures or Famity ATMOSPHERE For Two Twenty-Minute Stanparpizep Dott Pray 
INTERVIEWS 


(N=32 pairs of children) 








AccIDENT 
Free 


ACCIDENT 
REPEATERS 


Dot PrLay 
MEASURES 


SIGMA 


DIFFERENCE 
Mean Dire. 





Authoritarian Atmosphere 
Proportions . 
Time Units 84 
Affection 
Proportions 
Time Units 34 -03 
Solicitance of Affection 
Proportions -020 -020 
Time Units 5.50 5.44 


.006 





* p values are for only one tail of the distribution. 


It is the writer’s feeling that the inhibition 
hypothesis was overweighted in the original 
formulation of this study. The accident re- 
peating children demonstrate more aggressive 
drive, but only somewhat less aggression 
anxiety than the accident free children. They 
showed less inhibitions, in terms particularly 
of tangentiality, and a shorter delay in ex- 
pressing aggression. Some of the evidence 
in the direction of confirming the aggression 
anxiety hypothesis suggests that there may 
be some anxiety surrounding the expression 
of aggression. Although the accident repeat- 
ing children expressed more total aggression 
than the accident free children, a greater per- 
centage of their aggression is verbal rather 
than action in comparison with the accident 
free children. ‘This suggests that there is 
some attenuation of the intensity of their 
expression of aggression. 


Affection and Solicitance of Affection 


The results for Affection and Solicitance of 
Affection show small and statistically insig- 
nificant differences and are not consistent.® 
The results for Solicitance of Affection are in 
the predicted direction, but those for A ffec- 
tion are not. The trends indicate more 
affectionate behavior as well as more solici- 
tance of affection for the accident repeaters 
than for the accident free. One explanation 
of the divergence of findings from prediction 
might be that these measures are not meas- 
ures of reality representation, i.e., they do not 
reflect the child’s accurate perception of his 
family life. Instead they may mirror wish 
fulfillment of needs. When these categories 


® It is possible that these categories suffered most from 
unreliability of scoring as they were most difficult to 
differentiate. The percentages of agreement were lowest 
for Solicitance of Affection. a 
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are considered together, they are similar to 
Sears’ (20) dependency measure. There is 
some support for a wish fulfillment explana- 
tion of these measures in Sperry’s (21) find- 
ing that aggression and dependency maintain 
a balance in doll play. Thus, as the children 
in this study showed increases in aggressive 
behavior from session one to session two, they 
also increased in all behavior having to do 
with affection, possibly as a bid for appease- 
ment or for satisfaction of dependency needs. 


Additional Findings 


The results for additional categories of 
scoring included in part to allow for com- 
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repeaters (see Experimenter Interaction in 
Table 4). 


Family and School History 


Some casual findings about family and 
school history noted while collecting the data 
suggested that the experimental group may 
represent a sample different from the control 
population of five- to eight-year-olds. (Social 
histories were not obtained from families 
because of the promise to withhold the medi- 
cal information used in the selection of acci- 
dent repeaters.) It was noted that while 13 
of the 32 accident repeating children had 
transferred at least once from one school to 


TABLE 4 


AppiTiIonAL Dott Pray Measures ror Two Twenty-Minute Stanparpizep Dott Play INTERVIEWS 


(N=32 pairs of children) 











AccIDENT 
FREE 


AccIDENT 
REPEATERS 


Dott Pay 
MEASURES 


SIGMA 


IFFERENCE 
D Mean Dir. 





Stereotyped 

Thematic 
Proportions 
Time Units 


Nonstereotyped 

Thematic 
Proportions 
Time Units 


+074 
14.65 


Experimenter 


Interaction .046 


+044 


Activity 428.66 71.31 


.048 
7-33 





* p values are for both tails of the distribution. 


plete coverage of doll play behavior are pre- 
sented in Table 4. (Since no hypotheses were 
presented for this material, p is given for two 
tails.) They indicate that the accident free 
children tended to engage in more Stereo- 


typed Thematic Play (representative of 
reality in normal home routines), while the 
accident repeating children showed for time 
units significantly more Nonstereotyped The- 
matic Play (nonreality play, which often 
tended to the bizarre). .The accident repeat- 
ing children also demonstrated more total 
doll play behavior (Activity) than the acci- 
dent free children. These findings cannot be 
explained on the basis of greater incitement 
of the experimental group to activity as the 
experimenter did not interact to a signifi- 
cantly different extent with the accident 


another, only one of the 32 accident free 
children had done so. When these data were 
submitted to the chi-square test of independ- 
ence, a chi square of 12.74 was found, signifi- 
cant at the .or level. While seven of the 
accident repeating children came from broken 
homes, only one of the accident free children 
did. These data yielded a chi square of 5.14, 
significant below the .05 level. 

Of the 22 pairs of public school children 
seen, 9 of the accident repeating children 
were known to the home and school coun- 
selor, while only 2 of the accident free chil- 
dren were so known. The chi square for 
these data is 5.94, a finding which is signifi- 
cant below the .o2 level. 

The accident repeating children tended to 
come from larger families and to be later in 
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birth order. The average number of siblings 
for the experimental group was 2.69 while 
that for the control group was 1.50. The 
average birth order for the experimental 
group was 2.81 while that for the accident free 
group was 1.59. 


Implications of Findings 


The results of the family atmosphere cate- 
gories indicated that the accident repeating 
children showed more commands, threats, 
and prohibitions in doll play than did the 
accident free children, and that they also 
demonstrated more behaviors of the Affec- 
tion and Solicitance of Affection categories. 
Whether we interpret these findings as repre- 
senting the reality of family life for them, 
or, in the case of the Affection categories, as 
representing wish fulfillment for dependency 
needs, it is possible to draw one general con- 
clusion. Accident repeating children reflect 
more intense kinds of interaction between 
doll figures than do accident free children. 
Less of their play is of a stereotyped nature, 
suggesting that they are less reality oriented 
than accident free children. They also 


demonstrate a higher activity level, in agree- 


ment with other investigators’ findings that 
accident repeaters have a high degree of 
physical strength (1, 7). 

These findings are in accordance with the 
indications that accident repeating children 
come from home environments suggesting 
greater social disorganization than accident 
free children. The social findings indicated 
above (that they tend to come from broken 
homes, from families which move around 
often enough so that the children change 
schools relatively frequently, to be later in 
birth order, and to come from larger families) 
suggest less social stability, greater mobility, 
and more social disorganization. That these 
factors should be reflected in disorganized 
play and less play representational of normal 
and family activities is not surprising, if one 
accepts a reality representation explanation 
concerning doll play. 

At the present time, it cannot be conclu- 
sively shown from these results that there is 
any cause and effect relationship between 
personality characteristics and the accident 
repeating tendency. Indeed, it may be felt 
by some that the accident tendency is due to 
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a lack of supervision accompanying the social 
disorganization rather than to the personality 
characteristics resulting from those family 
interactions. It is interesting to note in this 
connection, however, Fuller’s (7) finding that 
accident repeating nursery school children 
tended to get into injury-laden situations in 
a carefully supervised nursery school environ- 
ment. If our findings on the accident re- 
peaters in this study are valid, then greater 
aggression and greater activity may provide 
the motivation for the selection of such situ- 
ations. Further research would need to 
verify the family, social, and personality char- 
acteristics and to study the nature of the 
accident environment more carefully to deter- 
mine to what extent the accidents of accident 
repeaters are “accidental.” 


SUMMARY 


This investigation was designed to test the 
hypotheses that accident prone children have 
greater aggressive drive, as a result of nurtur- 
ance frustration in an authoritarian home 
environment, and greater aggression anxiety, 
as a result of punishment for expression of 
aggression, when compared with accident free 
children. 

Thirty-two children of five to eight years 
of age who had three or more accidents dur- 
ing the prior four-year period were selected 
from the records of five Rochester hospitals. 
Thirty-two accident free children were 
matched with them for age, mental age, and 
sex from the same.schools and grades. 

The results which are reported are from 
the administration of two 20-minute stand- 
ardized doll play interviews. The children 
were asked to make up a story in response to 
a six-room doll house of maximum realism 
and organization. The standard doll family 
included a father, mother, boy, girl, and baby. 
The E’s role was one of minimal interaction. 
Synchronized protocols of recorded verbali- 
zation and of behavior were made. The doll 
play behavior was scored for categories both 
as percentages of total doll play behavior and 
in terms of the number of time units in which 
the behavior occurred. 

The major results found were as follows: 

1. Accident repeaters engaged in signifi- 
cantly more aggression in doll play than did 
accident free children. 
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2. Accident repeaters tended to show less 
inhibitions (tangentiality and need for re- 
assurance) than accident free children. 

3. Accident repeaters showed significantly 
less delay in expressing aggression than acci- 
dent free children. 

4. Accident repeating children showed sig- 
nificantly more verbal aggression than acci- 
dent free children and tended to show less 
action aggression. 

5. Accident repeaters tended to show 
greater increases in aggression and greater 
decreases in measures of aggression anxiety 
from session one to session two than accident 
free children, but the differences were not 
statistically significant. These findings sug- 
gest greater extinction of aggression anxiety 
for the accident repeaters than for the acci- 
dent free children. 

6. The accident repeaters tended to show 
more, but not significantly more, commands, 
threats, and prohibitions in doll play, and 
more instances of affection and solicitance of 
affection than accident free children. 

7. Accident repeaters showed more activity 
in doll play, and significantly more nonstereo- 
typed (unrealistic) thematic play. They 
tended to show less stereotyped (realistic) 
thematic play. 

8. The accident repeaters were noted to 
have come from larger families, to be some- 
what later in birth order, to have come more 
often from broken homes, to have transferred 
schools more frequently, and to be more often 
known to home and school counselors in the 
public schools. 
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N experimentally produced increase in 
A: achievement motivation of male 
college students has been shown to 
produce an increase in the frequency of vari- 
ous kinds if imaginative responses related to 
the motive in thematic apperception (7). 
This provides an empirical basis for using the 
algebraic sum of achievement-related re- 
sponses in thematic stories written by a par- 
ticular person (the n Achievement score) 
as a measure of the intensity of his motivation 
to achieve. The logic of this method of meas- 
uring individual differences in achievement 
motivation is supported by the results of 
studies in which individual differences in 
the n Achievement score have been meaning- 
fully related to differences in such behaviors 
as speed of recognition of achievement-related 
words (8), recall of interrupted tasks (3), per- 
formance on verbal and arithmetic tasks (4), 
level of aspiration, and others (6). The 
present paper presents the results of two suc- 
cessive attempts to determine whether or not 
the kinds of imaginative response which indi- 
cate achievement motivation in men can also 
be used as a basis for inference concerning 
strength of the motive when they occur in 
thematic stories written by women. 

On the basis of evidence already collected 
in studies involving only male subjects (Ss), 
a rudimentary theory concerning the deter- 
mination of the imaginative response from 
which motivation is inferred has been devel- 
oped (6). The implications for the present 
investigation of some of the complexities of 
the problem of achievement motivation in the 
American female can be clarified if treated 
within the framework of this theory. 

The n Achievement score obtained from 
imaginative stories written in response to a 
series of pictures under certain standardized 
conditions (6) is assumed to be a measure of 

1 Based on the undergraduate honors theses of Joseph 
Veroff (11), conducted at Wesleyan University in 
conjunction with Office of Naval Research Contract 
Nvyonr499, and Sue Wilcox (12), at the University of 
Michigan. 


the intensity of the person’s motivation to 
achieve at the time of writing the stories. 
The momentary level of achievement motiva- 
tion is affected by three classes of cues: (a) 
cues of everyday life over which the experi- 
menter (E) has virtually no control; (4) cues 
in the experimental situation immediately 
preceding the administration of the measure 
which are more amenable to experimental 
control; and (c) the more controllable cues 
contained in the pictures presented to elicit 
stories. Pictures are especially chosen to por- 
tray cultural situations in which behavior 
related to the motive might be expected to 
occur. The achievement-motivating effect of 
these various cues will not be the same for 
any two individuals. Individual differences 
in motivation produced by the same cues are 
attributed to differences in previous learning 
associated with them or similar cues. 

In terms of this theory, different intensities 
of achievement motivation are experimentally 
produced by varying the number of achieve- 
ment-motivating cues in the situation imme- 
diately preceding administration of the the- 
matic measure. This is essentially what 
McClelland, et al. (7) did in the experiments 
with male college students in which the 
changes in imaginative content associated 
with increasing intensities of the achievement 
motive were discovered. These experiments 
were conducted in a classroom setting. If an 
attempt is made to produce an experimental 
increase in the achievement motivation of 
women in a similar setting, the problem arises 
of whether or not the achievement cues in 
this setting will have the same motivating 
effect for women as for men. The inference 
can be made fror: the results of an investi- 
gation by Milner (9) that girls are more apt 
to be motivated by the cues of the classroom 
situation than boys. It is reported that adoles- 
cent girls work harder and are more success- 
ful in attaining school grades above their 
tested ability than boys. This suggests that 
the achievement-related learning experiences 
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of males and females in this culture may be 
different enough so that the same cues in the 
situation preceding the measure of n Achieve- 
ment may have a differential motivating 
effect on the two sexes. 

Other considerations lead to the expectation 
that males and females may respond differ- 
ently to the same picture cues. If the measure 
of achievement motivation is to be extended 
to women, it would seem reasonable to 
assume that women might show more evi- 
dences of their motivation in response to 
pictures of females in achievement-related 
activities than to pictures of males. But 
according to Mead (5), when girls reach 
adolescence, there is a change in emphasis 
concerning achievement. At this point in 
the life of a girl it becomes apparent that 
competition with the male is ultimately 
unrewarding. The female role becomes 
more clearly defined as noncompetitive and 
achievement is assigned almost exclusively to 
the male role. If Mead is correct in arguing 
that achievement is not included in the adult 
female role in America, pictures of female 
characters may not be effective in eliciting 
achievement motivation in women. Evidence 


already available shows that when the cues 
in the experimental situation are held con- 
stant, the frequency of achievement-related 
imaginative responses occurring in a story is 
a function of the number of achievement cues 
in the picture (2). 


ExpertMEnt I 


The first investigation to be reported was under- 
taken for the dual purpose of determining: (a) 
whether or not the method of measuring achieve- 
ment motivation developed on male college stu- 
dents is valid in a population of younger male Ss, 
and (4) whether or not it is a valid method of 
obtaining a measure of achievement motivation in 
female Ss. The selection of a mixed high school 
population made possible also an exploratory study 
of possible differential responses of the two sexes 
to both situational cues and pictures containing 
achievement-related cues in which the central char- 
acter was of the same or the opposite sex. 


Method 


Procedure. In general, the procedure was to have 
two groups composed of high school boys and girls 
write thematic stories in response to six pictures, 
three containing male characters and three contain- 
ing female characters. In one group the writing of 
stories was preceded by a period in which E 
attempted by instructions to minimize the number 
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of achievement-motivating cues that might be 
present in the situation. We refer to this condition 
as Relaxed Orientation. In the other group, the 
writing of stories was preceded by certain instruc- 
tions and the performance of certain tasks designed 
to maximize the extent to which achievement cues 
would affect Ss before the writing of stories. We 
refer to this condition as Achievement Orientation? 

Subjects were seniors (aged 16 to 78) in a large 
high school in a New England city (population 
200,000). Twenty-five girls and 23 boys comprised 
the Relaxed Orientation condition. Twenty-six 
girls and 29 boys comprised the Achievement 
Orientation condition. Each group was one home 
room. Since the placing of students in home 
rooms is determined alphabetically at the high 
school, it was assumed that Ss in the two groups 
were randomly selected from the same general 
population. 

Of the six pictures comprising the measure of 
achievement motivation, three containing male 
characters were selected from the group of pictures 
that had already been used in studies involving 
male college students. Three new pictures were 
selected from magazine cuts depicting achievement- 
related situations comparable to those portrayed in 
the pictures of male characters, but in these the 
central character was a female. The six pictures 
were presented in the following order in the 
Relaxed Orientation condition and, because of an 
error in administration, in just the reversed order 
in the Achievement Orientation condition. Since 
an earlier study has indicated that ordinal position 
of pictures in a series arrived at in a random man- 
ner has little, if any, effect on either the total 
n Achievement score or that obtained in response to 
a particular picture, this reverse in the order of 
pictures is thought to have had little effect on the 
general results (2). The designation of each pic 
ture follows its description. 


Two men standing in a room facing each other; one 
man is gesturing (E). 

Woman in front 
papers (3). 

Older man standing, younger man seated at desk, both 
holding a paper (D). 

Older woman with arm around younger woman (2). 

Two men apparently working at machinery in a 
shop (B). 


2We are following Alper’s(1) distinction between 
orientation which refers to what E actually attempts to 
do by means of certain procedures, and the term involve- 
ment which refers to a high degree of concern in the Ss. 
Our use of the label Achievement Orientation rather 
than the more general and ambiguous Ego Orientation 
is based on the belief that the term ego-involvement as 
generally used refers to the state of the person when 
any one of his more important motives has been aroused. 
In these experiments we are attempting to produce ego- 
involvement by engaging achievement motives. Other 
procedures might ego-involve Ss by engaging some other 
motive, ¢.g., need for affiliation (11). 

8 The data of several Ss in each condition are not 
considered in the analysis of results because of their 
(a) failure to identify their sex on data sheets, and 
(4) failure to write at least 25 words in response to any 
picture. 


of man’s desk holding some 
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Woman apparently writing on paper at a desk (1). 


Experimental conditions. The Ss in both con- 
ditions received identical instructions concerning 
the actual writing of imaginative stories. The 
instruction was an adaptation for use with younger 
subjects of that used with college populations (7). 
However, the period immediately preceding the 
instruction for the writing of imaginative stories 
differed in the two groups. 

Students under Relaxed Orientation remained in 
their home rooms for the procedure. The instruc- 
tion preceding the administration was begun by 
the home-room teacher: 

Today during the free period, we are going to be 
given some interesting tests that are being tried out at 
different places. I'd like to introduce (JV), a former 
pupil, now at Wesleyan, who will explain something of 
what you are to do and carry on from there. Pay close 
attention to his explanation. 


The teacher went to the back of the room, and E 
began: 

It feels great to be back at , and I think I am 
going to have an interesting time giving you this test 
this morning. You see, a group of us at Wesleyan are 
engaged in a project of testing imagination or story- 
telling. We are interested in seeing what different kinds 
of stories different people write. I thought I would try 
it out at — I think you will enjoy it, if you give 
it your close attention. 


The students under Achievement Orientation 
were directed to a special projection room in the 
school for the procedure. The instruction preceding 
the story-writing was begun by the home-room 
teacher: 

Today, during the free period, we are to be given 
some tests that have been given to various high school 
students. (RB)* from Wesleyan University is going to 
give you these tests. The Principal is anxious that 
——— do well on the tests. Pay close attention to the 
instructions, so you may do well. 


The teacher remained in the front of the room 
when E gave instructions for an anagrams test that 
had been handed out to the students. At the top 
of the test blank was the word Generation. Spaces 
were marked on the blanks on which the students 
were to write anagrams constructed from the word. 
Directions appeared on the form. The E gave these 
instructions: 

A simple process of testing is being worked out that 
may be a method of determining selections of men and 
women for future leadership roles and other important 
duties. So far, this test has becn given mostly to college 
students. We are now interested in how high school 
students make out on this test. Stucents from (a rival) 
high school have done very well. We would like to see 
what scores you will obtain in comparison. 

Now turn to the sheet handed to you. Put your 
name at the top. Do not start writing until I tell you. 
I will read the directions. Make as many different 
words as you can using only the letters in the word 
Generation. In other words, don't use the letters more 
times than they appear in Generation. You may use 
long words or short words and may include the names 
of persons, places, and slang words. Does everyone 


#We wish to acknowledge the assistance of Robert 
Birney in conducting the experiment. 
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Keep writing until I tell you to stop. 
Try to get as many as 


understand? 
You will have three minutes. 
you can. Gol 

The test was timed for three minutes. The E did 
everything he could during this period to heighten 
the achievement motivation of the Ss with respect 
to the task they were to perferm for three minutes. 

Following these preliminary procedures, instruc- 
tion for the measure of achievement motivation 
itself was presented. The Ss were told that it was 
a test of creative imagination or story-telling; that 
a number of pictures would be projected on the 
screen; that they would have twenty seconds to 
look at the picture, then about four minutes to 
write a story about it. They were informed that 
there were no right or wrong answers and that 
they might feel free to write any kind of story 
that they might choose. They were urged to try 
to make them interesting and not merely to describe 
the picture they saw. Furthermore, they were 
urged not to worry about grammar or spelling, but 
simply to tell as interesting a story as they could. 

The pictures were then presented in order. A 
picture was projected on the screen for 20 seconds, 
followed by a four-minute period in which Ss wrote 
about that picture. At the end of four minutes a 
second picture was shown, and so on. 


Scoring 


The original procedures for scoring thematic 
stories for n Achievement were reported in 1949 
by McClelland, et al. (7). Since then, several 
refinements have been made in the procedures for 
scoring. All of the stories in this study have been 
scored in terms of the refined criteria and method 
for obtaining the n Achievement score reported in 
McClelland, et al. (6). The n Achievement scores 
based on six stories were obtained for the 92 Ss of 
the present study on two occasions, with a rank- 
order correlation of .95 between these two scorings 
of the same stories. 


Results 


Two n Achievement scores were obtained 
from the stories written by each S—one from 
three stories written in response to pictures 
containing female characters, and another 
from three stories written in response to pic- 
tures containing male characters. Table 1 
shows that the mean n Achievement score of 
the boys obtained from stories about pictures 
containing male characters increases from the 
Relaxed Orientation to the Achievement 
Orientation condition. There is no evidence 
of an increase in the n Achievement score 
obtained for them from stories about pictures 
containing female characters. The n Achieve- 
ment score of the girls in response to pictures 
containing male characters, however, does not 
change from the Relaxed to the Achievement 
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TABLE 1 


Mean n AcHIEVEMENT Scores oF HicH Scnoot Boys ano Grats Ostainep FROM Stories WRITTEN IN 
Response To Pictures oF MALE AND FEMALE CHARACTERS UNDER RELAXED AND 
ACHIEVEMENT ORIENTATION CONDITIONS 








RELAXED ORIENTATION 





3 MALE 
PicTURES 


M SD M 


SuByEcTs 


3 FEMALE 
PicTuRES 


ACHIEVEMENT ORIENTATION 





3 FEemMate 
PicTuREs 


M SD 


3 MALE 
PicTURES 


—_— 





Boys 
Girls 


3.32 1.72 


4-02 


1.94 
5.72 


3.40 
1.77 2. 


Qo : 2. 1.92 


2.66 
3-30 


1.57 





Orientation condition. It is relatively high 
under both conditions. Neither is there any 
evidence of a change in the n Achievement 
score obtained from stories about pictures 
containing female characters. Finally, there 
is evidence that both boys and girls respond 
differentially to pictures containing male and 
female characters. The greater achievement- 
related responses are made to pictures con- 
taining male characters. 

Separate analyses of variance were made of 
the boys’ and the girls’ n Achievement scores 
(Table 2). The upper part of Table 2 shows 
that for the boys, differences attributable to 
pictures or to conditions alone are not sig- 
nificant. Differences attributable to the 
interaction of the pictures with conditions, 
however, are significant at the .05 level of 
confidence. There is no difference between 
the boys’ mean n Achievement score on the 
male pictures (1.94) and their mean n 
Achievement score on the female pictures 


(1.72) in the Relaxed Orientation condition. 
When motivation is experimentally increased 
by increasing the achievement-arousing cues 
in the situation prior to writing stories, the 
Achievement Orientation condition, the boys’ 
mean n Achievement score on the male pic- 
tures (4.93) is significantly higher than their 
mean n Achievement score (1.94) following 
Relaxed Orientation (S.E.diff.=1.08, t=2.77, 
df 44, p<.o1). There is, however, no in- 
crease in response to pictures of female 
characters. 

From these results we draw two conclu- 
sions. First, stories written in response to 
pictures of male characters do discriminate 
between two different experimentally pro- 
duced intensities of achievement motivation 
in high school boys, a result comparable to 
the experimental finding with college male 
students (7). This positive finding provides 
both evidence of the validity of the scoring 
system for younger male Ss and assurance 


TABLE 2 
ANALYsIs OF VARIANCE oF Boys’ AND Girts’ n ACHIEVEMENT Scores Oxstrainep FROM Stories WRITTEN 
in Response To Pictures CoNTAINING MALE AND FEMALE CHARACTERS UNDER RELAXED 
AND ACHIEYEMENT ORIENTATION CONDITIONS 








Source 


Sum oF SQUARES af 


EsTIMATED VARIANCE 





Pictures 

Conditions 

Pictures X Conditions 
Error 


Total 


Pictures 

Conditions 

Pictures X Conditions 
Error 

Total 


Boys 

104.11 
43-69 
53-91 
11.34 


Giris 


301.75 
-03 
1.64 
12.09 
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that the procedures employed were sufficient 
to produce different motivational intensities. 
Second, stories written in response to pictures 
of female characters do not discriminate be- 
tween two different experimentally produced 
intensities of achievement motivation in high 
school boys. 

The analysis of variance for the n Achieve- 
ment scores of the girls is summarized in the 
lower part of Table 2. It can be seen that for 
the girls, differences attributable to pictures 
alone are significant at the .or level of confi- 
dence. Differences attributable to conditions 
alone or to the interaction of pictures with 
conditions are not significant. 

The differential response to pictures con- 
taining male and female characters that was 
evident in the stories of the boys written 
under Achievement Orientation is consist- 
ently shown by the girls in both the Relaxed 
and the Achievement Orientation conditions. 
The mean n Achievement score obtained 
from pictures containing female characters is 
significantly lower than that obtained from 
pictures containing male characters. This 
particular result is consistent with Mead’s 
discussion of the differentiation of sex roles 
in the culture to which the Ss belong (5). 
Along with the absence of any evidence of 
an increase in the girls’ n Achievement score 
when attempts were made to produce an 
experimental increase, it raises the question 
of whether achievement-related responses in 
the stories of the girls can be considered evi- 
dence of motivation to achieve or whether 
their achievement responses simply reflect 
awareness of the cultural definition of male 
and female roles and are essentially unrelated 
to motivation in the girls. 

A review of Table 1 suggests that our prob- 
lem is not so much to account for a lack of 
increase in the n Achievement score of the 
girls from the Relaxed to the Achievement 
Orientation cendition as to account for the 
fact that the mean n Achievement score of 
the girls is already high in the Relaxed con- 
dition. Actually the girls’ mean n Achieve- 
ment score on the male pictures is higher, 
though not significantly so, under both 
conditions than the boys’ n Achievement 
score under Achievement Orientation when 
motivation has presumably been strengthened 
in them by experimental procedures. In the 
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Relaxed Orientation condition, the girls ob- 
tained a significantly higher mean n Achieve- 
ment score (5.72) in response to pictures of 
male characters than the boys (1.94). The 
difference, 3.78, is significant at the .o1 level 
of confidence (S.E.diff.=1.17, t==3,.23, df 38). 
This result is in accord with the rather gen- 
erally accepted notion that girls usually tend 
to be more highly motivated in the class- 
room situation than boys. One of Milner’s 
observations (9), that the accomplishment of 
a specific task is satisfying for the adolescent 
American girl, suggests the achievement 
motivation of girls may be elicited without 
the type of Achievement Orientation proce- 
dures that are necessary to heighten the 
achievement motivation of boys in classroom 
situations. Any testing situation, especially 
when there are men around to serve as addi- 
tional competitive cues, may serve to raise the 
level of achievement motivation in women 
of this culture. 

Additional support can be given the argu- 
ment that the n Achievement scores obtained 
from stories written by girls in response to 
the pictures of male characters in the Relaxed 
Orientation condition actually reflect a high 
level of achievement motivation. A compari- 
son can be made of the relative frequencies 
of various kinds of achievement-related re- 
sponse in the stories of the girls in both con- 
ditions and the boys in the Achievement 
Orientation condition. When the 13 cate- 
gories of imaginative response that contribute 
to the over-all n Achievement score (viz., 
Need, Instrumental Activity, Anticipatory 
Goal States, Obstacles, etc.[7]) are ranked 
according to their frequency in the stories 
written by boys and girls and the distribution 
of ranks obtained from the boys’ stories is 
compared with that obtained from the girls’ 
stories, the correlation is 81 (p<.o1) for 
relaxed girls with achievement-oriented boys, 
and .o1 (p<.o1) for achievement-oriented 
girls with achievement-oriented boys. This 
means that the pattern of achievement-related 
thoughts reflected in the thematic appercep- 
tion of girls in both conditions is like that of 
the boys in the Achievement Orientation 
condition. 

We will accept, tentatively, the hypothesis 
that the two experimental conditions which 
had a differential effect on the achievement 
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motivation of the male Ss were actually not 
different for the female Ss. That is, we 
assume there were a sufficient number of 
achievement-motivating cues for the girls, 
even in the Relaxed Orientation condition, to 
raise their level of motivation to a relatively 
high point and thus destroy the possibility of 
finding an increase when further achievement 
cues were added. 

We may turn now to the question of the 
differential response of the girls to pictures 
containing male and female characters. 
According to our present hypothesis, there is 
no more reason to expect an increase in 
n Achievement score obtained in response to 
pictures containing female characters than 
there is for expecting an increase in response 
to pictures containing male characters. The 
significantly greater n Achievement response 
to pictures containing male characters may 
mean either that the male character in an 
achievement-related situation is a more 
powerful elicitor of achievement-related 
thoughts than the female character in a 
similar situation as a result of the cultural 
differentiation of sex role regarding achieve- 
ment we have already mentioned, or that the 
attempt to select pictures containing female 
characters in achievement-related situations 
was not very successful. While an effort was 
made to choose pictures containing female 
characters having as many achievement- 
related cues as those already contained in the 
male pictures, we have no information, other 
than our own judgment, concerning the cue 
value of the pictures independent of the 
achievement-related response to them. An 
earlier study (2) has shown that male college 
students obtain a significantly higher n 
Achievement score in response to pictures of 
men in actual work situations suggesting 
achievement than in response to pictures con- 
taining male characters but no apparent 
achievement-related cues. This would sug- 
gest that before attempting to interpret the 
difference in response of the girls to pictures 
containing male and female characters, 
another attempt ought to be made to select 
pictures containing female characters having 
even more cues suggestive of achievement. 

One further bit of evidence indicates that 
the achievement-related responses of girls, 
whether elicited by pictures containing male 
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or female characters, may be taken to mean 
the same thing. The separate distributions 
of n Achievement scores obtained from male 
and female pictures were divided as near to 
the median as possible. In the Relaxed Con- 
dition, 8 of 10 high on male pictures were 
also high on female pictures, while 8 of 12 
low on male pictures were also low on female 
pictures (x?==4.79, p<.05). Similarly, under 
Achievement Orientation 8 of 10 high on 
male pictures were high on female pictures, 
while 11 of 14 low on male pictures were alse 
low on female pictures (x*=8.06, p<.or). 


ExperiMENT II 


A second experiment was designed to in- 
vestigate further the questions raised by the 
results of our initial attempt to study achieve- 
ment motivation in women. The working 
hypotheses derived from our first attempt 
defined the changes in procedures. First, it 
was assumed that the n Achievement score 
obtained from stories written by girls actually 
did reflect the intensity of their motivation 
to achieve at the time of writing the stories. 
According to this hypothesis, the Relaxed 
Orientation condition of the earlier experi- 
ment did not reduce achievement cues for 
girls to the same extent that it did for the 
boys. The classroom situation, the presence 
of boys in the class, a male experimenter, and 
reference to the stories as a test of creative 
imagination all were possible achievement 
cues for females. If they did act as achieve- 
ment cues, removal of these cues from the 
situation in which achievement motivation 
is measured should have the effect of lower- 
ing the n Achievement score. 

In order not to be left with an indeter- 
minate result should this second attempt to 
create a low motivation condition for female 
Ss fail to produce the expected decrease in the 
n Achievement score, a second basis was 
introduced to test the hypothesis that the 
n Achievement score obtained from stories 
written by women is a valid measure of 
achievement motivation. In the Achieve- 
ment Orientation condition of this second 
experiment, the anagrams test to be per- 
formed before the measure of achievement 
motivation was lengthened to 12 minutes in 
order to provide a fairly reliable performance 
measure of achievement motivation. Num- 
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ber of anagrams produced during the 12- 
minute period can be related to the n 
Achievement score obtained from the stories 
written immediately after this 12-minute 
period. If the measure obtained from the 
stories is actually a measure of achievement 
motivation, then the higher the n Achieve- 
ment score the greater should be the number 
of anagrams completed in the 12-minute 
period. This type of relationship with effort 
expended on simple verbal and arithmetic 
tasks has already been demonstrated with 
male college students by Clark and McClel- 
land ® and Lowell (4). 

The assumption that the differential re- 
sponse to pictures containing male and fe- 
male characters might have been due, at 
least in part, to our failure to select pictures 
containing female characters having enough 
achievement-related cues led to a second 
selection of pictures containing female char- 
acters. Every effort was made to select pic- 
tures having achievement cues equivalent to 
those in the male pictures. 


Method 


The extent to which achievement-related cues 
were removed in the Relaxed Orientation condition 
of the second experiment while being maintained 
in the Achievement Orientation condition is evident 
in a description of the two conditions. 

Relaxed Orientation condition. The Ss were 27 
female University of Michigan students visited in 
small groups in the dormitory after classes in the 
evening. There were five groups ranging in size 
from 2 to 10. Of necessity, all 27 were volunteer Ss. 

The E (SW), a female college student, tried to 
be relaxed, friendly, and to treat the experiment 
itself as of no particular importance. She talked 
with the girls as they gathered and was in no hurry 
to start the experiment. When all were present, 
the following instructions were given: 


I am doing a paper on thought processes and need 
some data to get an idea for it. So I would like you 
to write some imaginative stories about the pictures I 
am going to. hold up. I'll show a picture for around 
twenty seconds. Then you'll have about five minutes 
to write your story. Use a separate page of your book- 
let for each story. The questions on the sheets are to 
guide your thinking and enable you to cover all the 
elements of the plot in the allotted time. I'll keep time 
and tell you when to go on to the next question. Plan 
to spend about a minute on each question. There are 
no right or wrong answers involved, so feel free to 
write whatever you want. You don’t have to sign your 
name. Are there any questions? If you need more 


5Crarx, R. A., & McCretranp, D. C. A factor 
analytic integration of imaginative performance and case 
study measures of the need for achievement. Unpub- 
lished paper, Wesleyan Univer., 1950. 
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space, use the back of the page. All right, here's the 


first picture. 


It is assumed that having a female E visit the Ss 
in an informal way in a dormitory room in which 
no men are present, describing the experiment as 
“a study of thought processes” rather than as a “test 
of creative imagination,” not having Ss bother 
to sign their names, and the friendly, casual attitude 
on the part of E constitute a condition containing 
relatively few n Achievement-arousing cues. 

Six pictures were then presented in the following 
order; their designations follow the descriptions: 

Older man standing, younger man seated at desk, 
both holding a paper (D). 

Woman typing seated at a desk (a). 

Two men standing in a room facing each other; one 
man is gesturing (E). 

Man and woman facing each other in a library (b). 

Two men and a woman gathered around another 
woman seated at desk (c). 

Two men apparently working at machinery in a 
shop (B). 

After the sixth story was completed, E said, 
“Well, that’s all and I'd appreciate it if you would 
keep the details of this study confidential. I'll be 
working with other subjects later, and I’d rather 
they didn’t know anything about it. Thanks a lot 
for your time and cooperation.” 

Achievement Orientation condition, The Ss were 
the 26 female students in an introductory psychol- 
ogy class of men and women at the University of 
Michigan. The experiment was conducted during 
a class hour. Before the class began E did not 
mingle with the class. When the class had con- 
vened, the following instructions were given: 


Since Mr. will not be here today, I have 
arranged to use this period for a study of creativity, 
which has been shown to be an aspect of intelligence. 
(At this point E passed out a sheet of paper to each S.) 
The first task will be to make as many words as possible 
out of a word that I will write on the board. For 
example, if the master word were Washington, possible 
smaller words would be was, as, and so on. After each 
minute I will say “check” and you are to put a check 
after the last word you have completed. Indicate the 
last word completed at the time I say “check” and go 
on working.6 We believe that there are some sex 
differences in this mental ability, so please write your 
name and sex at the top of your sheet. Do this now. 
Please do your best, as we are interested in seeing how 
well people can perform when they are working at 
maximum efficiency. Work rapidly and try hard. Are 
there any questions? Here is the master word. 


The female experimenter then wrote the word 
Generation on the blackboard. 


Write down all the smaller words contained in 
Generation that you can find. 


The task was continued for 12 minutes, E saying 
check after each minute at which time Ss put a check 
after the last completed word. Then instructions 
to put name and sex on the paper were repeated 
and papers collected. The imaginative story 
blanks were passed out, and the following instruc- 
tions read: “This is a test of creative imagina- 


6 Op. cit. 
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TABLE 3 
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tion. . (From this point on the instruction 
was the same as that used in previous studies [7].) 

It is assumed that the classroom situation includ- 
ing male students, description of the experiment as 
a study of creativity with the stories presented as a 
test of creative imagination, an instruction to do 
one’s best and actual performance on an anagrams 
test which is related in some way to intelligence, 
the request to write down one’s name and sex, 
constitute a condition containing relatively many 
achievement-motivating cues. 

The procedure for administration of the pictures 
was identical with that of the Relaxed Orientation 
condition. 


Scoring. The stories written under both condi- 
tions were randomized and scored for achievement 
references according to the most recent revision of 
the n Achievement scoring procedure (6). Previous 
to this analysis the scorer (SW) had scored the four 
story records of 30 male Ss to check her scoring 
reliability against that of scorers with more experi- 
ence. A scoring reliability of .95 was obtained 
before analyzing the stories of this study. 


Results 


The mean n Achievement scores obtained 
by college women in response to pictures 
having male and female characters in the two 
experimental conditions are listed in Table 3. 
Once again there is little evidence of an in- 
creased n Achievement score between the Re- 
laxed and Achievement Orientation condi- 
tions. As in the earlier study, the mean n 


Achievement score in response to pictures 
having male characters is greater than that 
in response to pictures having female char- 
acters, 

The analysis of variance, Table 4, indicates 
that the difference between mean n Achieve- 
ment scores obtained from pictures having 
male characters and those having female 
characters is significant at the .o1 level of 
confidence. Despite the effort that was made 
to remove achievement-related cues in the 
Relaxed situation in order to be able to show 
an increase in n Achievement score from the 
Relaxed to Achievement Orientation condi- 
tion, and despite efforts to select pictures 
containing female characters in situations 
clearly suggestive of achievement, the results 
are virtually identical with those of the earlier 
experiment. It would appear that the differ- 
ential response to pictures containing male 
and female characters by both male and fe- 
male Ss must be accounted for in terms of 
the difference in the definition of male and 
female roles. Striving for achievement is 
without question identified with the male 
role in our data. Paradoxically, this means 
that if the imaginative achievement-related 
responses of female Ss are expressions of their 
own achievement motivation, these evidences 


TABLE 4 


ANALYsis OF VARIANCE oF CoLLeceE Women’s n ACHIEVEMENT Scores OBTAINED FROM Stories WRITTEN IN 
Response To Pictures oF MALE AND FeMALe CHARACTERS UNDER RELAXED 
AND ACHIEVEMENT ORIENTATION CONDITIONS 
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of motivation to achieve appear predomi- 
nately in stories about persons of the opposite 
sex. 

Several alternative explanations may be of- 
fered to account for this second failure to 
create a low motivation condition in female 
Ss. First, it might be argued that the experi- 
mental conditions still did not effectively 
alter the achievement cues in the situation for 
women. If the fault lies in the experimental 
condition, the weakness would seem to be 
that the college environment itself contains 
so many cues that are achievement-related for 
women that anything remotely resembling a 
test situation in this environment arouses 
achievement motivation in them. 

Second, it might be argued that this second 
Relaxed Orientation condition did reduce the 
achievement cues and would have produced 
a lower mean n Achievement score in a more 
representative sample of women. But col- 
lege women, as a group, are so highly moti- 
vated that the effect of our reduction of cues 
was negligible. Another form of the same 
argument would direct attention to the possi- 
bility that a volunteer group, as the Relaxed 
group was, might be more highly motivated 
than the more representative classroom sam- 
ple in the Achievement Orientation condi- 
tion. But the request for volunteers was de- 
liberately presented in a way that should 
have minimized the extent to which differ- 
ences in achievement motivation might deter- 
mine willingness to volunteer. The experi- 
ment was not, in other words, presented as 
an achievement-related task. It is probably 
unlikely that the two samples were different 
in any essential respect. 

A third alternative would be to reject the 
hypothesis that the n Achievement scores 
obtained actually reflect the achievement mo- 
tivation of women and to entertain the alter- 
native hypothesis that motivation in wornen 
is manifested in responses not now covered 
by the scoring procedures developed for men. 
The fruitfulness of this third suggestion can 
be estimated better after considering the re- 
lationship between the n Achievement scores 
obtained for the female Ss and their perform- 
ance on the anagrams task which was intro- 
duced into this study as an independent be- 
havioral measure of achievement motivation. 


Relation of n Achievement Score to Ana- 
grams Performance 


The anagrams task was presented in the 
Achievement Orientation condition both as 
a means of providing achievement cues and 
as a performance measure of n Achievement. 
The Ss with high n Achievement should 
work harder and therefore produce more 
anagrams in the 12-minute period allotted 
than Ss with low n Achievement. A positive 
relationship, in other words, would be evi- 
dence of the validity of the measure when 
applied to female Ss. 

The Ss in the Achievement Orientation 
condition were first divided into high and 
low n Achievement groups on the basis of 
scores obtained from the male pictures alone, 
then on the basis of scores obtained from the 
female pictures alone, and finally on the basis 
of the total score on all pictures. In each in- 
stance, the distribution of scores was divided 
as near to the median as possible in order to 
provide equal numbers in the high and low 
n Achievement groups. These divisions were 
made on the assumption that an achievement- 
related response in stories about either male 
or female pictures is evidence of achievement 
motivation. The earlier finding that a high 
score in response to one kind of picture is 
associated with a high score on the other was 
confirmed. Scores of 53 Ss in the Relaxed 
and Achievement Orientation conditions 
were thrown into two distributions, one for 
male pictures and one for female pictures. 
Both measures agreed in placing 35 of the 
Ss in either the upper or lower half of the 
distribution of scores (x*=5.61, df 1, p<.o1 
in the predicted direction). 

Table 5 shows the mean number of ana- 
grams produced during each two-minute pe- 
riod by high and low n Achievement groups 
according to each of these three measures. 
The relationship between the imaginative 
measure of n Achievement and the anagrams 
performance measure is essentially the same 
whether the n Achievement scores were ob- 
tained from pictures containing male char- 
acters or female characters. 

The combined measure, based on six rather 
than three stories and therefore the most reli- 
able, produces the most clear-cut relationship 
with anagrams performance. Except in the 
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first two-minute period, the high n Achieve- 
ment group consistently produced more 
words than the low n Achievement group. 
The difference in mean total word output (in 
the predicted direction) is significant at the 
05 level of confidence. The difference dur- 
ing the second two-minute period (in the 
predicted direction) is significant at the .o2 
level of confidence. These results are similar 
to those obtained by Clark and McClelland * 
and McClelland et al. (6) in a similar study 
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words are available. Here, differences due 
to motivation are minimized since little ef- 
fort is demanded. However, as fewer words 
remain to be found, the task becomes in- 
creasingly more difficult. Only then do dif- 
ferences in strength of motivation become 
apparent in differences in output. This dif- 
ference in motivation is first evident in the 
second two-minute period. As a result of 
having produced more words in the second 
two-minute period than the low n Achieve- 
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of n Achievement and anagrams perform- 
ance in men. They also found the relation- 
ship between anagrams output and n Achieve- 
ment score to be clearest during the middle 
portion of the task. In both studies, the high 
n Achievement group performed significantly 
better when the task became more difficult. 
Clark and McClelland’s explanation of this 
result adequately handles the present data 
and can be paraphrased here. During the 
first period the task is relatively easy. It 
isn’t difficult to make up words at the start of 
an anagrams test because so many possible 


T Op. cit. 


ment group, the high n Achievement group 
finds itself with an even harder task during 
the remaining periods since now there are 
relatively fewer words left to be found. In 
spite of this, they still produce more words 
than the low n Achievement group in every 
period of the task. Toward the end of the 
12-minute period in particular, the high n 
Achievement group, having fewer words left 
to find, has a much more difficult task than 
that of the low n Achievement group. This 
difference in the difficulty of the task presum- 
ably accounts for the lack of significance of 
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differences in the output of the two groups in 
the latter periods of the task. 

Unless this relationship is a function of 
differences in verbal ability operating in both 
the imaginative measure and in the anagrams 
task, it provides the first empirical evidence 
of the validity of the n Achievement scoring 
method developed on men for obtaining a 
measure of the achievement motive in 
women. Should there be a positive correla- 
tion between number of words written in the 
imaginative stories and n Achievement score, 
one might wonder whether the n Achieve- 
ment measure and the anagrams task were 
merely two measures of verbal fluency. The 
rank-order correlation between total number 
of words written on the imaginative stories 
and the total n Achievement scores of the 26 
Ss in Achievement Orientation condition is 
insignificant (rho=.08). 

The absence of any evidence to suggest that 
the relationship between n Achievement score 
and performance on anagrams can be ac- 
counted for in terms of differences in verbal 
fluency, and the similarity of this result with 
those of comparable studies using male col- 
lege students tends to confirm the hypothesis 
that the n Achievement score obtained from 
stories written by women is actually a meas- 
ure of achievement motivation. 

This positive result enables us to focus more 
clearly on the most pressing questions for 
future research. The general hypotheses pro- 
posed to account for the high motivation of 
female Ss under apparently relaxed condi- 
tions require both theoretical development 
and firm empirical anchorage. 

The tendency of both male and female Ss 
to give evidence of their motivation to achieve 
almost exclusively in thematic stories about 
men (a) indicates the promise of thematic 
apperception as a method of investigating 
role phenomena, and (4) calls into question 
some of the assumptions we often make con- 
cerning whom the subject identifies with in 
thematic apperception. 


SUMMARY 


The experimental procedure for producing 
an increase in achievement motivation and a 
measurable difference in the mean n Achieve- 
ment score obtained from thematic appercep- 
tion stories in male college students is shown 
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to produce an increase in the mean n Achieve- 
ment score of younger high school age male 
Ss. From this it is concluded that the scor- 
ing procedures developed in studies of male 
college students are applicable to a younger 
male population. 

The same experimental procedure does not 
produce an increase in the mean n Achieve- 
ment score of female high school students. 
Their mean n Achievement score is suffi- 
ciently high under both Relaxed and Achieve- 
ment Orientation conditions to suggest high 
achievement motivation in female Ss in both 
conditions. 

A second attempt to remove all possible 
achievement-related cues and create a low 
motivation, or Relaxed condition, with female 
college students failed to produce a decrease 
in the mean n Achievement score. However, 
despite this failure to show an experimentally 
produced difference in mean n Achievement 
score of female Ss, the relationship of n 
Achievement scores obtained from stories 
written by college women to their perform- 
ance on an anagrams task was found to be 
virtually identical to that obtained for men 
by Clark and McClelland.* This finding 
provides evidence that the scoring procedures 
developed in studies of college age males are 
also applicable to stories written by female 
Ss as a means of obtaining a measure of 
achievement motivation. 

Both male and female Ss respond differ- 
entially to pictures containing male and fe- 
male characters. In both cases, there is a 
greater achievement-related response to pic- 
tures containing male characters. This find- 
ing, which is consistent with Mead’s treat- 
ment of the differentiation of sex role con- 
cerning striving for achievement, indicates 
that both male and female Ss of this culture 
express their motivation for achievement pri- 
marily in imaginative stories in which the 
central character is a male. Of perhaps more 
importance than this substantive result is the 
clear implication that thematic apperception 
promises to be a fruitful avenue for empirical 
exploration of role phenomena. 
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new psychological testing procedure, ma- 

jor effort should be directed towards 
studying the relations between the relatively 
unknown instrument and various external 
criteria or measures of real-life behavior. 
Such efforts are being made or have been 
made concerning the leaderless group discus- 
sion (2, 5, 6, 11, 14, 15, 18, 24, 25). However, 
another approach which may lead to in- 
creased understanding of a particular meas- 
urement technique is to study its relations 
with other psychological testing procedures 
about which relatively more is known. 

The second approach has also been fol- 
lowed (2, 4, 9, 12, 14, 23, 25). For example, 
evidence was reported previously concerning 
the relation between verbal aptitude as meas- 
ured by the American Council on Education 


I’ assessing the measuring properties of a 


Psychological Examination (ACE) or the 


Ohio State Psychological Examination 
(OSPE) and LGD behavior. Bass (4) found 
a correlation of .33 between the OSPE and 
LGD ratings, while Coates (14) reported a 
correlation of .31 between total ACE scores 
and LGD ratings. Wurster and Bass (25) 
noted a correlation of .32 between the linguis- 
tic (L) scores of the ACE and LGD ratings 
in contrast to a correlation of .17 between the 
quantitative (Q) scores of the ACE and LGD 
ratings. 

Some evidence was also obtained concern- 
ing the consistency of LGD ratings with 
other leadership assessment devices. Over-all 
ratings of potential as future salesmen or 
supervisors based on individual interviews 
correlated .45 with ratings based on the LGD 
(4). Arbous and Maree (2) found a similar 
relationship between behavior during leader- 
less discussions and behavior of the same can- 
didates when assigned leadership roles, while 
Carter (9) found some consistency of be- 
havior of subjects (Ss) placed in different 
types of leaderless situations. 

However, to date, little quantitative evi- 


1 Parts of this study were aided by a grant from the 
Louisiana State University Graduate Council on Research. 


dence has been reported concerning person- 
ality factors which might be related to LGD 
behavior although Carter and Nixon (12) 
have obtained matrices of intercorrelations 
between other types of situational tests and 
paper-and-pencil tests of ability, interest, and 
personality while Taft’s (23) exploratory, 
qualitative study considered the relations 
between LGD assessment, the Rorschach, and 
other assessment devices. 


PuRpPosE 


The purpose of the present study is to re- 
port some evidence concerning the relations 
between ratings of participants’ behavior dur- 
ing the leaderless group discussion—which in 
several studies (2, 5, 6, 11, 14, 15, 18, 24, 25) 
has been found predictive of future leader- 
ship status in a number of situations—and as- 
sessments of the participants’ personality by 
the Rorschach test, by personal interviews, by 
the Guilford-Zimmerman Temperament Sur- 
vey (16), and by the University of California 
Public Opinion Study F or Authoritarian 
Scale (1). Since the LGD has been found 
to measure leadership potential fairly well, it 
was hypothesized that personality variables 
which tend to differentiate those of high 
from those of low leadership status would 
likewise tend to differentiate those who 
would score high on the LGD, a predictor 
of that leadership status, from those who 
score low. 

In his extensive survey of the literature, 
Stogdill (22) found a number of personality 
characteristics which were associated posi- 
tively with leadership status in a wide variety 
of situations. Characteristics included intel- 
ligence, dependability, general activity, social 
participation, sociability, initiative, persist- 
ence, self-confidence, cooperativeness, adapta- 
bility, verbal facility, judgment, aggressive- 
ness, ambition, and sense of humor. It was 
hypothesized that some of these traits—as in- 
dicated by Rorschach test assessment, by ap- 
praisal of attitudes towards oneself and others 
as revealed during a personal interview, and 
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by inventoried traits as disclosed by the GZ 
Temperament Survey or F Scale—would be 
associated with ratings of performance on 
the LGD. If this were demonstrated, then, 
additional indirect evidence would be pro- 
vided to suggest that ratings of leaderless 
group discussion behavior were indicative of 
leadership potential in a wide variety of situa- 
tions. The investigation would also help to 
determine what LGD assessment had in com- 
mon with other approaches to personality as- 
sessment. The importance of this determina- 
tion has been emphasized by McClelland (19) : 


The notion of validity assumes that there is one 
way of measuring (defining) a construct which is 
better than (or “truer” than) any other measure. 
All other measures can then be tested for validity 
against this standard measure. .. . But in person- 
ality theory there is often no basis for arguing that 
any ome measure is better than any other... 
(Rather), a concept is useful theoretically (1) to 
the extent that the person using it makes explicit 
its operational meanings and (2) to the extent that 
those operations cut across and are confirmed in 
different areas of behavior (19, p. 157). 


As it is used in this article, leadership po- 
tentiai, the concept in question, refers to the 
extent to which a person is likely to adopt 
and play the role of leader successfully in a 
wide variety of situations. The leader of a 
designated group may be defined as the mem- 
ber (or members) whose behavior results in 
the progress of his group in a shared direc- 
tion. While it is recognized that an individ- 
ual’s potential as a leader will depend to some 
extent on the specific situation and nature of 
the group to be led, it is assumed that there 
are individual differences in leadership po- 
tential above and beyond the situational vari- 
ations, i.c., which cannot be accounted for 
by variations from situation to situation in 
what is deemed adequate leader behavior. 
This assumption is in line with Stogdill’s re- 
port, which indicates that it is necessary to 
consider the interaction of situational and 
personality factors when attempting to ac- 
count for variations in the adequacy of leader 
behavior. The influence of situational varia- 
tions obviously will depend on the range of 
situations one wishes to include in a particu- 
lar study. The situational component will 
become extremely important if one includes 
in a given analysis of individual differences 
in leader potential, leaders from such diverse 
groups as clans of Australian aborigines, staffs 


of U.. S. Army officers, DAR clubs, and 


gangs of juvenile delinquents. On the other 
hand, if in a particular analvsis, one narrows 
the range to leaders of college students only, 
to leaders of verbal situations only, or to 
leaders of social situations only, it is also 
obvious that personality variations (or varia- 
tions associated with the individual) will ac- 
count for most of the variance in the ade- 
quacy of leader behavior. 

The rationale of the present report is this: 
Where situational variations are reduced to 
a minimum, personality variations will ac- 
count for much of the variation in leadership 
potential. Evidence to support the hypothe- 
sis that the LGD can be used as a means of 
measuring leadership potential will be pro- 
vided if those individuals identified by the 
LGD as having high leadership potential are 
similarly identified by other assessment tech- 
niques such as the Rorschach or the G-Z 
Temperament Survey as having high leader- 
ship potential or as having the characteristics 
commonly found among leaders in a wide 
variety of situations. This study will also 
suggest the extent to which LGD assessment 
is consistent with other approaches to assess- 
ing personality. 

The relations of assessment based on the 
LGD with the Rorschach test, with analyses 
based on a personal interview, with the 
Guilford-Zimmerman Temperament Survey, 
and with the F scale each will be considered 
in a separate section. 


RorscnHacn Correcates or LGD Benavior 


Procedure 


One hundred and forty college girls, 20 of whom 
were pledges or members in each of 7 sororities, 
volunteered to undergo three hours of psychologi 
cal testing. Each of the 20 groups was composed 
of one representative of each sorority. This was 
done to reduce familiarity among group members 
to a minimum. 

As part of the assessraent program, reported in 
more detail elsewhere (15, 17), each of the 20 
groups was subjected to an initially leaderless group 
discussion of 30 minutes’ duration, The instructions 
and administrative details of this technique have 
been described elsewhere (3, 5, 6, 15, 17). Two 
trained observers used a standardized checklist to 
rate the extent to which girls attained leadership 
status during the initially leaderless discussion. 
The correct split-half reliability of the total rating 
based on the correlation between the ratings of 
both observers was .87. Following each discussion 
the highest or lowest ranked participant in attained 
leadership status was administered a Rorschach test. 

Whether the girl selected for Rorschach assess- 
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ment was a high or low ranked participant was 
unknown in advance to the Rorschach adminis- 
trator. One girl was presented to him after each 
discussion in randomized order except that the 
restriction was imposed that 10 would be of highest 
attained leadership status and 10 would be of 
lowest attained leadership status. (Hereafter, these 
girls will be referred to as leaders and nonleaders.) 

The scoring of the Rorschach test followed the 
procedure outlined by Beck (7). All of the scoring 
was checked by a Rorschach interpreter to correct 
for personal bias. There was a rank-order correla- 
tion of .gt between the administrator's and inter- 
preter’s estimates of the over-all adjustment of the 
20 examinees. 

Since further understanding of the Rorschach was 
not the intent of this study, detailed statistical 
analysis of the specific subscores obtained from 
scoring the Rorschach protocols was not performed. 
Rather the Rorschach interpreter,? who never inter- 
viewed Ss personally, examined all 20 protocols 
and the scores based on the protecols. He then 
wrote a brief personality sketch on the basis of this 
examination and estimated whether the protocol 
was that of a leader or nonleader. 


Results 


The 1o high ranked LGD participants or 
leaders made an average of 39.1 responses to 
the Rorschach cards while the low ranked 
LGD participants or nonleaders made an 
average of 25.4 responses. The difference of 
13.7 responses was significant at the .or level 
of confidence. This finding is consistent with 
several previous studies which found that 
high verbal output was characteristic of LGD 
leaders as compared with nonleaders (3, 4). 
It is also consistent with the analysis reported 
in the next section which will show that 
LGD leaders tend to make many more re- 
sponses than do LGD nonleaders during an 
hour personal or individual interview where 
they were required to talk mainly about 
themselves. All these results tend to suggest 
the existence of a verbal productivity factor 
which permeates the LGD, the Rorschach, 
and the individual interview. This verbal 
productivity factor, in turn, appears to be a 
characteristic which differentiates leaders 
from nonleaders according to Stogdill’s sur- 
vey of personal traits associated with leader- 
ship status (22). This appears especially true 
about conference leaders (21). 

The Rorschach expert guessed correctly the 
LGD leadership status of 16 of the 20 girls 
by examination of the Rorschach protocols 
alone. Two nonleaders were falsely identi- 
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fied as leaders and 2 leaders were falsely 
identified as nonleaders. A phi correlation 
coeficient of .60 was obtained between 
guessed and actual leadership status. Accord- 
ing to chi-square analysis, this relationship 
was significant at the .or level of confidence. 

A summary of the traits ascribed by the 
Rorschach expert to 2 or more of the 8 girls 
correctly identified as LGD leaders and 2 
or more of the 8 girls correctly identified as 
nonleaders proved revealing. Two or more 
of the 8 leaders were described as: strongly 
imaginative or having a good imagination, 
7 (were so described), highly energetic, 5; 
highly intelligent, 4; emotionally stable or 
having a very healthy, normal reaction to 
stress, 3; strongly interested in details, 3; 
opinionated, 3; hard or tough-minded, 2; 
self-centered, 2; highly ambitious, 2; highly 
verbal, 2; and able to see the larger aspects 
of things, 2. Two or more of the 8 correctly 
identified LGD nonleaders were described 
as: stereotyped or conventional in thoughts 
and perceptions, 4; immature, 4; inadequate 
emotional expression, 3; unintelligent or dull, 
3; unclear, plodding, confused thinker, 3; 
lazy, passive or unenergetic, 3; unsociable, 3; 
impulsive, 2; unimaginative, 2; pessimistic, 
2; and insecure, 2. 

Of the 4 misidentified girls, one was a clear 
misjudgment. The second girl who was 
identified mistakenly as a leader, at the same 
time, was considered by the Rorschach 
worker to be artificial, insecure, pessimistic, 
preoccupied with health, lacking in social 
feeling, with pretentions towards learning 
and art. The third, who was identified mis- 
takenly as a leader and who had personal 
characteristics ascribed to her which are usu- 
ally associated with leadership status, actually 
was very high in leadership status within her 
own sorority according to peer ratings (15) 
although she performed poorly on the LGD. 
The fourth girl, a leader on the LGD identi- 
fied wrongly as a nonleader, was described as 
having a feeling of personal insecurity which 
required great need for personal attention 
from others. In this last case, the suggestion 
is that although personality inadequacies 
were present similar to those of other non- 
leaders, some inadequacies present actually 
created for the girl a strong drive toward 
leader status. 
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Despite the crude and rough nature of the 
analyses and the small size of the sample, the 
different pattern of personality characteristics 
ascribed to LGD leaders compared with non- 
leaders suggest to the investigators that per- 
sonality traits commonly ascribed as differ- 
entiating leaders from nonleaders in a wide 
variety of situations differentiate LGD lead- 
ers and nonleaders. 


Persona INTERview: AtrirupE Towarp 
ONESELF AND Orners RELATED TO 
LGD Benavior 


Procedure 


While the highest ranked participant of 10 of 
the leaderless group discussions was administered 
the Rorschach test, the lowest ranked girl was 
subjected to an hour personal interview. When 
the lowest participants of 10 of the other discussions 
were given the Rorschach test, the highest ranked 
members were interviewed. Because of an admin- 
istrative error, it was possible to actually make use 
of the interviews of only 9 leaders and 9 nonleadcrs. 
Once again, the interviewer was not informed of 
whether his interviewee was an LGD leader or 
nonleader. 

The purpose of the interview was to allow the 
leaders or nonleaders to display their attitudes 
toward themselves and their relations with others. 
Except for the first few minutes of the interview, 
a nondirective approach was utilized in order to 
help the interviewee to reveal readily the way she 
regarded herself. The interviewer's job was one 
of “sensitive reflecting and keeping threat at a 
minimum.” 

Although the organization of the interview hour 
was largely the responsibility of S, at an early 
stage of the interview, attention of S was focused 
upon herself in a directive manner. Two standard 
interview “openers” were used. They were as 
follows: 

1. “What was there about the leaderless group 
discussion that you liked?” This question per- 
mitted Ss to speak about the favorable and un- 
favorable aspects of the situation that they had just 
experienced. It afforded the girls the opportunity 
to express their feelings and attitudes toward the 
group discussion question, the group discussion 
observers, the other participants. their effect on the 
group, and in many cases what they thought their 
performance was. 

2. The “W-A-Y (who are you?)” procedure of 
Bugental and Zelen (8) followed the initial ques- 
tion. It involved asking the subject to write 3 
answers to the one question, “Who are you?” in 
any manner that she liked. After completing this, 
an inquiry took place along lines suggested by 
Bugental and Zelen (8): 

a. What do you feel is the essential word in this first 
answer? And now in the second? . . . And the third? 

4. Suppose, just for the moment, that I had the power 
to change any one of these three answers so that it 
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would no longer be true of you in any way. Which 
one would you let me change first? 

¢. Now, which one would you hold on to the last, 
which one would you not let me change? 

d. When you consider most of the people whom you 
know, which one of these answers would they be apt to 
think of first in relation to you? 

e. And which one would they think of last in relation 
to you? 

j. Which one do you like best? 

g- And which one do you like the least? . 

Ah. Which one would you think most people would 
like best? 

i. And which one would you think most people would 
like least? 


After the two standard interview “openers” it was 
relatively easy for most of the girls to start talking 
about themselves and the things that interested 
them. Some took their lead from the previous 
experience (“Who are you?”) and started talking 
about their personal affairs, their relations at home, 
the problems that they are feeling, their attitudes 
toward school, sorority, etc. Others went back to 
the leaderless group discussion and discussed its 
effect on them, the contribution they and others 
made and what they thought of it. Eventually, they 
too came to talk about themselves. 

Each interview was tape recorded. 


Analysis of Data 


Upon listening to the recording of the interview 
a second time, a rater divided each interviewee’s 
responses into response units such as “Their atti- 
tude hinders me” or “I was worried at first whether 
we were going to get anywhere.” Each unit was 
an expression of a complete thought. 

Each unit was placed in one of four categories, 
modified after Bugental and Zelen (8), depending 
on the subject of the response as follows: 

A. Responses concerning oneself and the effect 
of one’s behavior and characteristics on oneself. 
Examples: “I am always prompt” or “My temper 
worries me.” 

B. Responses concerning the effects of one’s be- 
havior and characteristics on others. Example: 
“I sold them the idea readily.” 

C. Responses concerning the effects of other per- 
sons’ behavior and characteristics on oneself. 
Example: “He offended me.” 

D. Responses concerning others’ behavior and 
characteristics and the relations of other persons 
with each other. Examples: “John gave her news.” 
“Henry is lazy.” 

For each response, an inference was made con- 
cerning the attitude of the interviewee toward the 
subject of the response, i.c., whether it was favor- 
able, unfavorable, or neutral. Responses were thus 
classified into 12 possible types: favorable responses 
about oneself, unfavorable responses about oneself, 
neutral responses about oneself, and so forth. 

The total number of responses was determined 
of each of the 12 types made by each of the 
interviewees. 

For a five-minute sample of each recording, the 
rater’s categorizations were checked against the 
categorizations of a second rater. The average 
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correlation * between the raters was .85 suggesting 
that the coding of the responses according to the 
procedure described above could be done with 
satisfactory reliability. 


Results 


Table 1 displays the total number of re- 
sponses of each of the 12 possible types yielded 
by the 9 LGD leaders and the g LGD non- 
leaders. The table also shows the ratio of the 
leaders’ to the nonleaders’ frequencies of re- 
sponses of each type. Altogether, leaders 
made one and a half times as many responses 
as nonleaders. This difference was signifi- 
cant at the .or level. Once again, the sare 
verbal productivity factor appeared to be op- 
erating as had been found for the LGD, the 
Rorschach test and in previous studies. 

A series of chi-square analyses was per- 
formed on the data shown in Table 1. The 
major purpose of these analyses was to test 
whether or not LGD leaders, as a group, dis- 
played a different frequency pattern of re- 
sponses from that of LGD nonleaders. In 
order to test the null hypotheses of these analy- 
ses, it was necessary to assume that no sys- 
tematic or biasing variations in frequencies 
existed among the g leaders or among the 9 
nonleaders, for the extent to which random- 
ness prevailed for the total frequencies of each 
type of response for each of these two sub- 
groups, as groups, depended on the random- 
ness of variation within the two subgroups, 
since the sums of randomly distributed scores 
are randomly distributed. 

The 3 first-order interactions among each 
of the possible pairs of modes of classifica- 
tion were significant beyond the .o1 level of 
confidence. 

It was inferred from these tests of signifi- 
cance that whether or not a response was 
favorable, unfavorable, or neutral in attitude, 
depended upon whether the respondent was 
a leader or nonleader as well as whether she 
was discussing herself, her relations with 
others, or the relations among other persons. 

Inspection of the ratios shown in Table 1 
revealed the major points of interaction of 
interest to this article. They can be sum- 
marized as follows: 

1. Whereas LGD leaders made 1.5 times 
as many responses altogether as nonleaders, 

8 These correlations were obtained by correlating the 


total frequencies with which each rater assigned each 
interviewee’s responses into each of the 12 categories. 
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they made 1.9 times as many favorable, and 
only 1.4 times as many unfavorable responses. 
Similarly, leaders made 1.6 or 1.7 times as 
many responses concerning themselves, their 
effects on others and vice versa while they 
made only 1.3 times as many responses about 
the relations among other people. On the 
whole, therefore, LGD leaders tended to a 
greater degree than nonleaders to discuss 
themselves and their relations with others 
and to exhibit, in general, more favorable 
attitudes towards the various subjects of their 
responses. 

2. LGD leaders expressed 2.5 times as many 
favorable responses about themselves as did 
nonleaders although their total responses con- 
cerning themselves was only 1.7 times that 
of nonleaders and their total favorable re- 
sponses towards all subjects was only 1.9 
times that of nonleaders. Similarly, leaders 
showed 2.1 times as many favorable responses 
concerning their effects on others and twice 
as many favorable responses concerning other 
persons’ effects on them, the leaders, although 
according to the corresponding ratios of all 
favorable comments (1.9 to 1) and all re- 
marks about relations of the self with others 
(1.6 and 1.7 to 1) the expected ratios for these 
two cells were about 1.7 or 1.8 to 1 respec- 
tively. A lower than expected ratio of 1.2 
to 1 was found when comparing leaders’ 
with nonleaders’ frequencies of expressions 
of favorable attitudes towards the relations 
among others. 

On the other hand, nonleaders made al- 
most as many unfavorable remarks about 
themselves as did leaders. The ratio of 1.1 to 
1 was much lower than expected. Likewise, 
the ratio of 1.3 to 1 obtained when comparing 
the frequency with which leaders and non- 
leaders expended unfavorable comments 
about their effects on others was less than 
expected according to the corresponding ratio 
of all unfavorable comments expressed (1.4 
to 1) and the ratio of all responses about one’s 
effects on others (1.6 to 1). The ratio of the 
remaining cells did not deviate markedly 
from expectation. 

These results suggested that LGD leaders 
far more frequently regard themselves, their 
effects on others, and other persons’ effects 
on them, in a favorable light, while non- 
leaders tend to look disparagingly on them- 
selves and their effects on others. The 
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investigators felt that these results were con- 
sistent with expectations. 


Inventoriep Traits Retatep to LGD 
BEHAvior 
Procedure 

It was possible to administer the first 5 subtests 
of the Guilford-Zimmerman Temperament Survey 
to 76 of the 140 sorority girls and all 10 subtests to 
49 of the 76. Because few girls could be given both 
the individual interview and the G-Z Temperament 
Survey and because girls with extreme scores on 
the LGD were administered the interview, it was 
expected that the samples of 76 and 49 girls given 
the G-Z Temperament Survey would be more 
homogeneous in LGD scores than the total sample 
of 140. 

This personality inventory was selected because 
there was some evidence available concerning its 
factorial and external validity. In addition, it was 
developed and standardized on a college population. 
Each of the subtests appeared to have adequate 
reliability although there was doubt about the sub- 
tests’ factorial purity or their independence from 
each other (16). 


Results 

Table 2 shows the Pearson product-mo- 
ment correlations between each of the sub- 
tests of the G-Z Temperament Survey and 
leadership status attained during initially lead- 


erless discussion. As expected, the samples 
of 76 and 49 girls who were administered the 
survey were restricted significantly in range 
of LGD scores. The standard deviation of 
LGD scores for the successively smaller se- 
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lected samples of 140, 76, and 49 girls were 
11.7, 9.4, and 8.3 respectively. Column 2 of 
Table 2 lists the correlations uncorrected for 
this restriction in range and Column 3 lists 
the estimates of what these correlations 
would have been had G-Z Temperament data 
been available for all 140 girls. 

There was a significant correlation between 
leadership status attained on the LGD and 
self-ratings on factor A (Ascendance and So- 
cial Boldness versus Submissiveness). The 
correlation between LGD scores and A was 
.37 uncorrected and .44 corrected for restric- 
tion in range. At 74 df, a correlation of .28 
is significant at the .o1 level of confidence. 

The correlation between factor S, Sociabil- 
ity, and LGD score was .22 uncorrected and 
.27 corrected for restriction in range. A cor- 
relation of .22 is significant at the .05 level at 
74 aj. From these results, it was inferred that 
the two traits A and S, both commonly found 
associated with leadership status in a wide 
variety of situations, were significantly re- 
lated to performance on the LGD. The 
higher degree of social boldness and social 
interest among those higher in LGD score 
appeared consistent with the results of the 
personal interview analysis which indicated 
that LGD leaders were more likely to be 
satisfied with their relations with others. 

The remairder of the correlations between, 
the G-Z factors and LGD behavior were sug-’ 


TABLE 2 


CORRELATIONS BETWEEN SCORES ON THE 10-Factor PeRsoNALITy SCALES oF THE GuILFoRD-ZIMMERMAN 


TEMPERAMENT SURVEY AND THE Extent To WuicH 


LeapersHip Status Was 


Arratnep Durineo Aw InitTiALLy Leapertess Group Discussion 








PERSONALITY Factor 


r UNCORRECTED r CORRECTED 
FoR RESTRICTION FoR RESTRICTION 
IN RANGE IN RANGE 


(2) (3) 





G General Activity & Energy vs. Inactivity, Slowness 
R Restraint, Seriousness vs. Impulsiveness 
A Ascendance & Social Boldness vs. Submissiveness 
S Social Interest & Sociability vs. Shyness, Seclusiveness 
E Emotional Stability vs. Emotional Instability & De- 
pression 
Objectivity vs. Subjectivity & Hypersensitiveness 
Friendliness & Agrecableness vs. Hostility & Bel- 
ligerence 
Thoughtfulness & Reflectiveness vs. Thoughtlessness 
& Extraversion 
Personal Relations & Cooperativeness vs. Critical- 
ness & Intolerance 
Femininity vs. Masculinity 


033 -I5 
-16 20 


+37 -44 
.22 27 


+16 .20 
-04 


.16 


+13 


-10 
02 





@f=47, r=.37 at .o1 level and .29 at .05 ‘evel of confidence. 
df=74, r=.28 at .o1 level and .22 at .05 level of confidence. 
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gestive, but not significantly different from 
zero. LGD leadership status appeared 
slightly positively associated (if at all) with 
G (General Activity and Energy); R (Re- 
straint and Seriousness); E (Emotional Sta- 
bility); T (Thoughtfulness and Reflective- 
ness) and P (Personal Relations and Cooper- 
ation). It was also possible that a slight 
negative relationship existed between LGD 
status attained and F (Friendliness vs. Bel- 
ligerence). 

These results, in general, tended to be con- 
sistent with the findings of the previous sec- 
tion in which analyses of LGD leaders were 
more likely to regard themselves favorably, 
while LGD nonleaders tended to comment 
about themselves unfavorably. 

The correlation between G-Z Tempera- 
ment Survey variables and LGD status at- 
tained in general tended to support indirectly 
the inference that LGD behavior could be 
used as an assessment of leadership potential, 
since most of the factors which were cor- 
related with LGD status attained also appear 
to be correlated with leadership status in a 
wide variety of situations. 

To the investigators, there appears to be 


a fairly consistent correspondence of out- 
comes from assessments based on the LGD, 
the Rorschach, the G-Z Temperament Sur- 
vey, and the attitude and behavior exhibited 
during personal interviews. 


AUTHORITARIAN ATTITUDES AND LGD 
BEHAVIOR 


Procedure 


There were several reasons for studying the rela- 
tion between the F scale and LGD behavior. 
Carter, Haythorn, et al. (11) had found that par- 
ticipants of initially leaderless group discussions 
must display authoritarian behavior in order to 
attain high leadership status. In addition, it was 
suggested by others to the investigators that the 
LGD was a means of selecting “authoritarian per- 
sonalities” rather than those of high leadership 
potential, although on a pilot sample of 15 ele- 
mentary psychology students, a correlation of —.58 
had been found by one of the investigators between 
the F scale and LGD scores. Also, LGD nonleaders 
appeared to interviewers to be more stereotyped, 
rigid, and prejudiced than did nonleaders. For 
these reasons, it was thought pertinent to examine 
the correlation between leadership status attained 
during initially leaderless discussions and scores on 
the F scale, since this scale appears valid as a 
measure of conventionalism, submission to authority, 
authoritarian aggression, stereotypy, power, and 
“toughness,” as well as such personality factors as 
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rigidity, lack of tolerance for ambiguity, and so 
forth (1). Accordingly, the F scale (Forms 40 and 
45) was administered to a sample of 284 ROTC 
seniors. LGD ratings obtained a year earlier were 
available for 172 of these. 


Results 


The mean obtained F score of 3.83 and the 
standard deviation of .75 indicated that the 
sample, although from the deep South, was 
not much different in mean score than the 
California samples on whom the F scale had 
been standardized. It was, however, some- 
what restricted in range. This may have 
served to attenuate the correlations reported 
below. 

Inspection of the scatterplot of the relation 
between the F and LGD scores revealed the 
existence of a curvilinear relationship. While 
a Pearson product-moment correlation of 
—.16 was found when a linear regression 
was assumed, an eta of .30 was obtained 
when the curvilinear nature of the regression 
was considered.* The chi-square test for line- 
arity cf regression disclosed that the eta was 
significantly different from the Pearson cor- 
relation at the .o1 level of confidence. 

The negative Pearson correlation of —.16, 
significant at the .05 level, suggested to the 
investigators that there existed a slight, gen- 
eral tendency for those who score higher on 
the LGD to be more resourceful, flexible, 
and self-reliant and less stereotyped, submis- 
sive, power-oriented, and rigid individuals 
than those who scored lower on the LGD. 

Inspection of the scatterplot indicated that 
those extremely authoritarian in personality 
according to their F scale scores did very 
poorly as a group on the LGD. Maximum 
success on the LGD occurred for those par- 
ticipants who tended to be permissive—but 
not too permissive. Those extremely tolerant 
in attitude tended to score somewhat lower 
on the LGD, although not nearly as low as 
those who were extremely rigid and inflexible 
as judged by the results of the administration 
of the F scale. 

This curvilinear relation may be a function 
of the sample and the situation. It is con- 
ceivable that for ROTC personnel, in the 
South especially, extremely permissive per- 
sons are less likely to attain leadership status 

* When the same Ss were retested on the LGD a year 
later, the corresponding Pearson correlation was —.09 


and the eta was again .30 between the F scores and 
LGD scores on the retest. 
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than would be the case for samples drawn 
from populations more liberal in attitude. 

It is difficult to state specifically how the 
results of this analysis can be connected with 
recent evidence which has been accumulating 
in a number of studies which suggests that 
successful business, industrial, and military 
supervisors tend to be permissive in their 
supervisory methods, interested in being of 
service to others, and considerate of the wel- 
fare of their subordinates. However, the in- 
vestigators believe that the analysis of the F 
scores of LGD participants is consistent with 
such evidence. 


SUMMARY 


The purpose of this study was to demon- 
strate the extent of the consistency of LGD 
personality assessment with various other per- 
sonality assessment procedures, specifically: 
the Rorschach test, analyses of interview pro- 
tocols, the Guilford-Zimmerman Tempera- 
ment Survey, and the UCPOC F scale. 

A Rorschach analyst blindly judged cor- 
rectly the leadership status of 16 to 20 LGD 
leaders and nonleaders. The kinds of per- 
sonality traits ascribed to the LGD leaders 
by the Rorschach interpreter were in line 
with those commonly found among leaders 
in a wide variety of situations. 

It was inferred from analysis of protocols 
based on generally “nondirective” interviews 
that LGD leaders more often displayed favor- 
able attitudes towards themselves and their 
relations with others while nonleaders more 
often spoke unfavorably about themselves 
and their effects on others. 

Leadership scores based on status attained 
during the LGD correlated .37 with Ascend- 
ancy scores on the Guilford-Zimmerman 
Temperament Survey; they correlated .22 
with Sociability scores on the Survey, while 
they correlated curvilinearly with the F scale, 
an assessment of the tendencies towards stere- 
otypy, moral conventionalism, submission to 
authority, and rigidity. The results as a 
whole suggest to the investigators that status 
attained during the LGD is, as McClelland 
might say, “a psychological measurement 
with explicit operational meanings whose 
operations to some extent cut across and are 
confirmed in different areas of behavior.” 
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PERSEVERATIVE BEHAVIOR IN A CONCEPT-FORMATION TASK 
AS A FUNCTION OF MANIFEST ANXIETY AND RIGIDITY * 


ELIZABETH L. WESLEY 2 
State University of lowa 


IcipiTy is a dimension of behavior that 
R has been widely referred to in descrip- 
tions of both normal and abnormal 
individuals. Although individual investi- 
gators have tended to define this concept in 
somewhat different ways, it is felt that a 
single, broad definition may be extracted for 
the purpose of this study. Therefore, rigid- 
ity is conceived of as a tendency to persist in 
responses that may previously have been 
suitable in some situation or other, but that 
no longer appear adequate to achieve current 
goals or to solve current problems. 
Research into the nature of rigidity has 
developed from at least two different direc- 
tions. By the administration of batteries of 
sensorimotor tasks to normal subjects (Ss), 
some investigators (2, 3, 9, 17) have isolated 
a factor that has been labelled “ability to shift 
set,” or “disposition rigidity.” These investi- 
gators have usually included a somewhat 
limited range of tasks performed in a single 
standard testing situation, thus restricting 
the generality of the conclusions drawn. 
Luchins (10, 11, 12), who has investigated the 
rigidity of approach of normal Ss in problem- 
solving situations, has pointed out the many 
temporary factors influencing rigidity and 
raises serious doubts concerning the general 
consistency of this behavior variable. Ro- 
keach (15), on the other hand, has reported 
a relationship between rigidity in problem- 
solving situations and tendencies toward 
ethnocentric behavior, indicating the possi- 
bility of a consistent influence of this variable 
in two quite different behavioral situations. 
Other investigators have studied various 
clinical groups with respect to their ability to 
shift set in tasks of concept formation. Gold- 
stein and Scheerer (5) describe rigidity as 
1 This article is based on a dissertation submitted to 
the faculty of the Department of Psychology of the State 
University of Iowa in partial fulfillment of the require- 
ments for the degree of Doctor of Philosophy. The 
author wishes to express her indebtedness to Drs. I. E. 
Farber and Arthur Benton for their guidance throughout 
the course of this investigation. 


2 Now at the Louisville Mental Hygiene Clinic. 


one aspect of the concrete behavior they found 
to be characteristic of patients with cerebral 
pathology. Hanfmann and Kasanin (8), 
among others, have studied concept forma- 
tion in schizophrenics and report results 
similar to those found by Goldstein and 
Scheerer for organics. More recently Fey (4) 
has applied a similar task, the Wisconsin 
Card-Sorting Test, to schizophrenics and 
reports more perseveration in the schizo 
phrenic than in the normal group. 

Thus, there seems to be evidence that rigid- 
ity may be a general characteristic of behavior 
and that it may be related in some fashion to 
other important personality variables. The 
evidence, however, is more presumptive than 
real. Therefore, a next step in an understand- 
ing of this problem would be to attempt to 
measure rigidity independently and then 
study the relationship of this variable to be- 
havior in clinical situations that presumably 
measure rigidity. If this could be done, it 
would serve a twofold purpose. First, it 
would be a step in the direction of establish- 
ing the consistency of rigidity as a behavior 
variable. Second, it would serve to define 
more precisely one of the variables presum- 
ably influencing behavior in the clinical 
situation. 

In order to ascribe variations in perform- 
ance in a given clinical situation to a given 
personality characteristic, it is not sufficient 
merely to show that there is a relationship 
between them; it is necessary also to show 
that the kind of behavior involved could not 
reasonably be attributed to other kinds of 
behavior tendencies. Thus, schizophrenic 
patients may show poor concept formation 
and it may be correct to ascribe this to rigid- 
ity. However, since these patients may pos- 
sess other characteristics in common, such as 
low drive level, some of these characteristics 
might also be at least partially responsible for 
poor concept formation. It would be desir- 
able, therefore, to determine the possible 
effect on performance in the concept-forma- 
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tion task of other personality characteristics 
that might reasonably be expected to influence 
such performance. 

Among these personality characteristics is 
that of manifest anxiety. Taylor (19) has 
defined manifest anxiety in terms of scores 
on a paper-and-pencil questionnaire involv- 
ing instances of behavior judged by expert 
clinicians to be indicative of overt anxiety. 
Taylor (19) and Spence and Taylor (18) 
have found that Ss scoring high on this scale 
show a significantly greater amount of eyelid 
conditioning than Ss scoring low on this scale. 
These results were predicted on the suppo- 
sition that manifest anxiety involves a state 
of heightened drive and therefore affects be- 
havior in the manner of a drive state, as indi- 
cated by Hull's behavior theory.* Other 
investigators (14, 16) have also demonstrated 
that Ss scoring high on the Taylor Anxiety 
Scale behave in a manner implying a height- 
ened drive state when given tasks involv- 
ing stimulus generalization and rote serial 
learning. 


PRoBLEM 


The purpose of this investigation was to 
study the influence of rigidity, defined by 
scores on a questionnaire, on behavior in a 
concept-formation task. It was predicted that 
Ss considered to be highly rigid in terms of 
this independent measure would take longer 
to learn to shift their set in the concept-for- 
mation task than would the subjects low in 
rigidity. 

A second purpose of this study was to 
determine the relationship between manifest 
anxiety and behavior on a concept-formation 
task. This was undertaken as a means of 
adding to our knowledge of the factors influ- 
encing concept formation as well as permit- 
ting control of a relevant factor that might 
otherwise interfere with the understanding 
of the influence of rigidity as such. How- 
ever, the probable direction of the relation- 
ship, if any, could not be specified. A recent 

8 Hilgard, Jones, and Kaplan’s(7) results raise an 
interesting problem in this regard. They found that in 
differential conditioning anxious Ss did not give more 
conditioned responses to positive stimuli than did Ss low 
in anxiety. However, the anxious Ss did give more con- 
ditioned responses to negative stimuli. Although, on the 
hypothesis that manifest anxiety reflects drive strength, 
we would not expect the former result (except under 


special conditions, ¢.g., unusually low maximum per- 
formance), the latter result is consistent wih expectation. 


study (14) has indicated that the effect of 
anxiety upon performance depends upon the 
difficulty of the task involved—or more pre- 
cisely, upon the initial relative strengths of 
correct and incorrect response tendencies in 
the situations. In the absence of highly 
detailed information with respect to this 
aspect of a given concept-formation task, it 
was not possible to predict the direction of 
any effects of anxiety that might occur. 


EXPERIMENTAL PRoceDURE 


Personality Measures 


Ninety items thought to be indicative of rigidity 
were selected by the experimenter from a number of 
paper-and-pencil personality tests and some original 
items were constructed. These items were then sub- 
mitted to five clinical psychologists who were in- 
structed to rate each one in terms of the degree of 
rigidity demonstrated in the statement: high, mod- 
erate, or very small. Fifty items rated by all five 
judges as indicating a high degree of rigidity were 
selected, of which the following are examples: 

There is usually one best way to solve most problems. 

I prefer work that requires a great deal of attention to 
detail. 

I find it easy to stick to a certain schedule once | 
have started on it. 

These items were combined with 17 filler items 
selected from the Minnesota Multiphasic L, F, and 
Pt scales to make up the Rigidity Scale. Only the 
rigid items contributed to the rigidity score, how- 
ever. A revision (see 16) of the Taylor Anxiety 
Scale was used as the measure of manifest anxiety. 


Subjects 


Three groups of Ss were selected from under- 
graduate psychology classes on the basis of their 
scores on the Rigidity Scale, the Revised Anxiety 
Scale, and the L Scale of the Minnesota Multiphasic 
Personality Inventory. 


1. Rigid Group. 21 Ss scoring in the upper decile 
(score 28) on the Rigidity Scale and below the 6oth 
percentile (score 18) on the Anxiety Scale. 

2. Anxious Group. 21 Ss scoring in the upper decile 
on the Anxiety Scale (score 29) and below the median 
(score 23) on the Rigidity Scale. 

3. “Normal” Group. 30 Ss scoring below the median 
on both the Rigidity (score 23) and Anxiety (score 16) 
Scales. 


All Ss used received a T score of 50 or less on the 
L Scale of the Minnesota Multiphasic Personality 
Inventory. 


Apparatus and Materials 


The experiment was conducted in a well-lighted, 
relatively quiet room. S was seated in front of a 
black screen which rested on a table. A box con- 
taining four card-sorting compartments, with sides 
and backs of hardware cloth, was placed in a rec- 
tangular opening in the screen. The cards to be 
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sorted were similar to those of the Wisconsin Card- 
Sorting Test described by Grant and Berg (1, 6) 
with the exception of a colored border that appeared 
on some of the cards. The cards varied with respect 
to all possible combinations of color, number of 
figures, border, and form of the figures they con- 
tained; no two cards were exactly alike. They were 
sorted into four compartments on the basis of cue 
cards placed above each compartment. The four 
cue cards were respectively: one blue triangle, two 
red circles, three yellow squares, and four green 
crosses (all without borders). The cue cards were 
not repeated among the cards to be sorted, leaving 
124 cards in the pack. If S placed the card in the 
correct compartment, a green jewel light mounted 
on the screen above the compartments to the left of 
S was immediately flashed on by the experimenter. 
A red jewel light mounted to the right of the green 
light was flashed if S placed the card in the wrong 
compartment. 


Procedure 


The instructions indicated that the Ss were to 
learn to sort the cards into the correct compartment 
using the cue cards as their guide. The S sorted 
the cards until he achieved ten consecutive correct 
sortings (each sorting was considered as one ¢ria/) 
in the first series. A series was defined as any group 
of trials during which one particular cue always 
comprised the required basis for sorting. After 
making ten correct sortings on the basis of the first 
cue, ¢.g., number, a different cue, ¢e.g., form, was 
arbitrarily and without warning to S made “cor- 
rect,” requiring a shift in S’s conceptual set. When 
ten correct sortings had been made on the basis of 
this cue, the third remaining simple cue, e.g., color, 
was made “correct” in the same fashion, requiring 
a third shift in set by the subject. The basis (cue) 
for sorting was assigned to the four series in a 
modified counterbalanced order. 

The S was not permitted to correct his sorting 
once the card had been placed into a compartment 
and the signal indicating that the response was right 
or wrong had been given. The Ss who sorted 124 
cards without reaching a criterion in the first series 
or who sorted 372 cards without reaching the fourth 
series were discarded. Two Anxious and two Nor- 
mal subjects were discarded by this criterion. 


RESULTS 


Relation Between the Measures of Rigidity 
and Manifest Anxiety 


Before determining the relationship be- 
tween rigidity, manifest anxiety, and concept 
perseveration, it appeared desirable to investi- 
gate the relationship between the paper-and- 
pencil inventories used to define these general 
personality characteristics. The correlation 
(Pearson’s r) between the scores on the 
Anxiety Scale and those on the Rigidity Scale 
(269 pairs of scores) was +-.10 and not sta- 


tistically significant. Although an inspection 
of the plotted scores indicated the possibility 
of the existence of a curvilinear relationship, 
the eta of +.23 was also not significant. 
Thus, there is no basis for assuming a rela- 
tionship between scores on these two scales. 


Performance on the Concept-Formation Task 


1. Number of trials to criterion. The 
means, standard deviations, and medians of 
the number of trials taken by the Rigid, 
Anxious, and Normal Groups to reach the 
criterial trials for each series are presented in 
Table 1.* It may be seen from this table that 
the median scores for the Rigid and Anxious 
Groups were identical on the first series, 


TABLE | 
Means, STANDARD DeviATIONS, AND MEDIANS OF 
NumsBer oF Triats To CriTeRIon ReEQuireD 
BY THE Ruicip, ANxious, AND NorMAL 
Groups on Four Series oF TRIALS 








SERIES 
Gaeup I TT 





M ° +19 «14-33 
Rigid SD . -90 14.41 
(N=21) Mdn ‘ .00 7.00 
M ‘ -14 8.90 
Anxious SD ‘ -95 11.02 
(N=21) Mdn ‘ -00 3.00 
M ° -230—«11..53 
Normal SD , (jo) 88.93 
(N=30) Mdn 6. .00 6.50 





being lower than that of the Normal Group. 
Following this series, however, the median 
scores for the Rigid Group were in each 
instance higher than those for the other two 
groups, while the Anxious Group tended to 
show the lowest median number of trials to 
the criterion. 

These differences between the results on 
Series I and the remaining series were by no 
means surprising since the scores for Series 
I had a different meaning from those based 
on Series II, Ill, and IV. Series I may be 
thought of as the original learning part of the 
concept-formation task, whereas scores on 

*It may be seen from the results presented in Table 1 
that the medians were in each instance lower than the 
means and that the standard deviations were very large 
relative to the means, indicating that the distributions of 
scores were all positively skewed. Because of the ex- 
treme skewness of the distributions, it was felt that the 
medians were more representative of the entire distribu- 


tions than the means since they would be less affected 
by extreme scores. 
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Series II, III, and IV reflected at least two 
factors—ability to learn the card-sorting task 
plus ability to shift from a previously estab- 
lished conceptual set. The difference between 
these two factors might be considered to be 
similar to the difference between facility in 
acquiring a response and speed of extinction 
of a response. 

A U test® was applied to the scores on 
Series I of the Rigid, Anxious, and Normal 
Groups to detect possible differences in learn- 
ing ability on the card-sorting task. The 
probability values obtained were as follows: 
Anxious and Rigid, p<.80; Anxious and 
Normal, p<.06; Rigid and Normal, 
p<.o8. Although the Anxious and Rigid 


TABLE 2 
Means, Sranvarp DeviaTions, AND MEDIANS OF 
Correctep Scores ror THE Ricip, Anxious, 


AND Norma Groups on Series II, 
III, ano IV * 
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nificant for any single series or for the total 
series; (>) the difference between the total 
corrected scores fer the Rigid and Normal 
Groups appeared reliable (p=.04). Thus, 
the Rigid Group took significantly longer to 
shift its set, when all series were considered, 
than did the Normal Group. 

2. Perseveration scores. Since the present 
study was concerned with differences in 
amount of rigidity among the three experi- 
mental groups as defined in terms of per- 
severative, nonadjustive behavior, it was 
important not only to determine whether 
performance on the Rigidity Scale and the 
Anxiety Scale was related to number of 
trials taken to formulate a new conceptual 


TABLE 3 


Means, Stanparp Deviations, AND MEDIANS oF Per- 
SEVERATIVE RESPONSES FoR Ricip, ANXIOUS, 
AnD NorMac Groups On SERIES 
II, Il, ano IV 








Group II Ill IV Tora 





M 76.19 66. -76 223.24 
Rigid SD 37.40 20. -25 96.70 
(N=a1) Mdna 66.00 64. -00 200.00 
M 66.71 61. -14 «185.33 
SD 20.78 16. -16 48.36 
Mdn 66.00 62. .00 190.00 
M 69.40 63. 83 192.66 
SD 27.75 12. +22 39.04 
Mdn 62.50 61. -.00 184.00 


Anxious 
(N=a21) 


Normal 
(N=30) 





* Sixty points were added to all scores to climinate negative 
ues. 


Groups behaved more like each other than 
like the Normal Group, there is no sound 
basis for assuming that obtained differences 
between any of these distributions of scores 
may be attributed to other than chance 
factors. 

On the assumption that scores on Series 
I were a fair index of an individual’s learning 
ability, this score was subtracted from each 
individual’s score on the succeeding series, 
so that a clearer picture of the perseverative 
factor might be obtained. The medians for 
these corrected scores are presented in Table 
2. The U test between the corrected scores 
of the three groups yielded the following 
results: (a) the differences between the 
Anxious and Normal Groups were not sig- 

5Cf. Mann and Whitney (13). The evident skewness 
of the distributions and the lack of homogencity of the 


variances precluded the application of more conventional 
tests of significance such as F or ¢ to these scores. 


SERIES 


II Ill IV Torar 





00 35.48 
-14 29.00 
co =«.: 16.00 
-38 «613.67 
-12 9-55 
00 9.00 
17 19.93 
-90 17.03 
50 13.00 


8.52 
8.21 
6.00 


4-09 


M -95 
SD -70 
Mdn -00 
M +19 
SD ‘5S 3-73 
Mdn -00 3.00 
M -83 5.93 
SD -63 6.58 
Mdn -50 3.50 
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set, but also whether the responses on these 
trials were of a perseverative sort. For the 
purpose of this study, perseverative responses 
were defined as responses which would have 
been correct according to the cue for the 
previous series, but which were no longer 
correct in the series in which they occurred. 

In order to compare the Rigid, Anxious, 
and Normal Groups with respect to number 
of perseverative responses, the means, stand- 
ard deviations, and medians of these scores 
were computed for the three groups for 
Series II, II], and IV. These results are 
presented in Table 3. An inspection of the 
table reveals that the Rigid Group averaged 
consistently more perseverative responses than 
either the Anxious or Normal Groups. The 
Normal Group was next highest in number 
of perseverative responses and the Anxious 
Group made the least number of persever- 
ative responses, on the average. These differ- 
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ences were not statistically significant, on the 
basis of U tests, for Series II and III, but 
were significant for Series IV (R>A, p<.01; 
N>A, p=.02; R>N, p=.o1). In terms of 
the total number of perseverative responses 
over the three series, the Rigid Group showed 
significantly higher scores than either the 
Anxious (p=.02) or Normal (p=.05) 
Groups. The Anxious and Normal Groups, 
however, differed no more than might be 
expected by chance in terms of this measure. 


Discussion 

The results of this investigation have 
shown that rigidity as defined by a paper- 
and-pencil inventory is related to persevera- 
tion of response in a card-sorcing task of con- 
cept formation. The highly rigid Ss did 
not differ significantly from the less rigid 
Ss (Anxious and Normal Groups) in their 
original performances on the experimental 
task. However, when shifts in conceptual 
set were required, the high rigidity group 
took significantly longer to shift to the new 
sets. This relative difficulty in shifting to a 


new set seemed to result from difficulty in 
giving up the old set, since the rigid group 


made significantly more perseverative re- 
sponses than did the less rigid Ss. It might 
be said that the response sets of the highly 
rigid Ss showed greater resistance to extinc- 
tion than those of the nonrigid Ss. 

It is important to note the nature of the 
relationship established in this study. The Ss 
who had a tendency to perseverate in a rela- 
tively large number of everyday situations, 
or at least said that they did so, also tended 
to persist longer in their set in the experi- 
mental situation. In essence, a response- 
response relationship has been established 
which suggests some consistency between 
responses that occur in everyday situations 
and those that occur in a more rigorously 
controlled experimental situation. Some de- 
gree of generality in the tendency to main- 
tain a response or a response set in different 
situations is thereby indicated. 

The present study has not established any 
clear-cut relation between manifest anxiety 
and perseveration of response in a concept- 
formation task. Although the Anxious 
Group had consistently lower scores than 
the nonanxious groups in terms of number 
of trials taken to shift set and in terms of 
number of perseverative responses, these dif- 
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ferences were not statistically significant. Since 
the differences appeared to increase with suc- 
cessive shifts and since they were consistently 
in the same direction, it is possible that an in- 
creased rapidity in the formation of concepts 
in a task of this sort is related to a high level 
of drive. If this experiment were repeated 
with a larger number of Ss and with a greater 
number of series, a significant relationship 
between anxiety and performance on this task 
might be demonstrated. It is clear, however, 
that the present study has not done so. 

Since there is reason to suppose that rela- 
tionships between manifest anxiety and other 
variables are explicable in terms of motiva- 
tional factors (18, 19), it is possible that the 
differences between the Anxious and Rigid 
Ss in perseveration on this task were due to 
differences in drive level in the two groups, 
i.c., the Anxious Group may have been opera- 
ting on the basis of a higher drive state than 
the Rigid Group. If the higher drive state 
of the Anxious Group was the factor chiefly 
responsible for these differences in perform- 
ance, then the Anxious Group should also 
have differed significantly from the Normal 
Group, which presumably had a lower drive 
level. However, the differences between the 
Anxious and Normal Groups were not sig- 
nificant. 

Actually, there is no necessity for supposing 
that rigidity is associated with a relatively 
low drive state. Indeed, if rigidity repre- 
sents a mechanism of defense against the 
basic drive state of anxiety, then it may be 
symptomatic of a high level of drive, just as 
manifest anxiety is conceived to be symp- 
tomatic of an intense basic anxiety state. 
We may suppose, then, that the Anxious and 
Rigid Groups did not differ in terms of drive, 
but that the habits associated with the basic 
anxiety state may be different for the two 
groups. In other words, rigidity of behavior 
may reflect the presence of a general habit 
tendency rather than lowered motivation. 

Still another explanation of the differences 
in the resistance to extinction of response sets 
in the Rigid and Anxious Groups might be 
made in terms of differential rates of develop- 
ment of inhibition in the two groups. Since 
extinction is brought about, according to 
Hull, by the building up of inhibition, indi- 
vidual differences with respect to the develop- 
ment of inhibitory tendencies might be mani- 
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fested in terms of degree of behavioral rigidity. 
Further research is needed to test the theo- 
retical questions raised by these different in- 
terpretations of our results, 


SUMMARY 

The purpose of the present investigation 
was to study the relationship of the general 
personality characteristics rigidity and mani- 
fest anxiety as defined on the basis of paper- 
and-pencil questionnaires, to a task of the type 
used in clinical studies of concept formation. 
Three groups of Ss were selected on the basis 
of their scores on these scales: (a) Rigid 
Group—21 Ss scoring high on the Rigidity 
Scale and low on the Anxiety Scale; (5) 
Anxious Group—21 Ss scoring high on the 
Anxiety Scale and low on the Rigidity Scale; 
and (¢) “Normal” Group—3o Ss scoring low 
on both scales. The Ss were required to learn 
a card-sorting task similar to the Wisconsin 
Card-Sorting Test requiring the formation 
of concepts and the shifting of these concepts. 

The results were as follows: (a) No rela- 
tionship was found between manifest anxiety 
and rigidity as defined by paper-and-pencil 
questionnaires. (4) The Rigid, Anxious, and 
Normal Groups did not differ significantly 
in terms of number of trials to reach the 
criterion in Series I, original learning. (c) 
However, when individual differences in 
original learning were taken into account, 
the Rigid Group teok significantly longer to 
shift its set on subsequent series. In addition, 
they gave significantly more perseverative 
responses. (d) Although the Anxious Group 
showed consistently lower scores than the 
Normal Group in number of trials taken to 
shift set and in number of perseverative 
responses, these differences were not sta- 
tistically significant. 

These results were interpreted as indi- 
cating that Ss who had a tendency to per- 
severate in a relatively large number of every- 
day situations, or at least said that they did 
so, also tended to persist longer in their set 
in the experimental situation. Some degree 
of generality in the tendency to maintain a 
response or a response set in different situa- 
tions is thereby indicated. No clear-cut 
relation between manifest anxiety and per- 
severation of response in a concept-formation 
task was established. Differences between the 
effects of manifest anxiety and rigidity on 
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perseveration of response set in the card- 
sorting task might be attributed to differences 
in the habits associated with the basic drive 
state or to differential rates of development 
of inhibition in the two groups. 
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MENT: PLACEMENT OF ITEMS WITH INDIVIDUAL 
CHOICE OF NUMBER OF CATEGORIES * 


MUZAFER SHERIF 
University of Okiahoma 


XPERIMENTAL evidence has been accumu- 
kK lated in recent years to indicate that the 

individual’s judgment of stimuli is con- 
siderably affected by his attitude toward the 
stimuli being judged. Various kinds of am- 
biguous stimuli are used to demonstrate this 
tendency (2, 6). On the other hand, a major 
assumption in attitude measurement has been 
that placement of items on an issue along a 
scale is not affected by the attitudes of the 
judges used in constructing the scale (3, 7). 
The writers (4) have recently indicated that 
this apparent contradiction is resolved when 
individuals with sufficiently differing stands 
on the issue serve as judges. Under this con- 
dition, it was demonstrated that the place- 
ment of items is affected by the attitudes of 
the judges. Using the typical Thurstone type 
of attitude scaling (method of equal-appear- 
ing intervals), in which a specified number 
of categories is fixed by the experimenter and 
the scale values are determined by the fre- 
quencies with which items are placed in each 
category, characteristic displacements of judg- 
ments, depending on the pro or con attitudes 
of the judges, were found. “Neutral” items 
particularly were displaced toward the ex- 
tremes of the scale, the result being that items 
were piled up in certain of the prescribed 
categories to the detriment of others, with 
characteristic gaps in between. Various ex- 
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ance in tabulations; and Drs. I. L. Janis, H. H. Kelley, 
and F. D. Sheffield, for editorial suggestions during the 
preparation of the manuscript. The warm cooperation 
of Dean Laurence Snyder in facilitating administrative 
arrangements is also gratefully acknowledged. 


AND 


135 


CARL I. HOVLAND 
Yale University 


perimental checks indicated that this concen- 
tration of items by specified subjects (Ss) was 
not due to carelessness, but to the Ss’ personal 
involvement in the issue. 

The effect of attitude in the judgment of 
such items may perhaps be revealed even 
more clearly if the number of categories is 
not fixed by the experimenter (E), but the 
Ss are instructed to use whatever number 
seems necessary to differentiate the various 
positions on the issue. Investigation of this 
problem is of particular interest, for ulti- 
mately it may provide a means of utilizing 
the individual’s own categorization of state- 
ments as a behavioral index of his stand on 
an issue. Under conditions where the indi- 
vidual is free to choose the number of cate- 
gories used, one can investigate the extension 
or constriction of his scale through the num- 
ber of categories he considers necessary, and 
the distance between successive steps on his 
scale through his distribution of items to 
various categories. This method of individ- 
ual categorization may ultimately provide a 
concise basis for distinguishing between indi- 
viduals who only consider a narrow range of 
alternatives and regard everything as either 
“black or white” on an issue; and, at the 
other extreme, those who are willing to con- 
sider the whole gamut of alternatives on an 
issue and make fine distinctions within the 
total range. An individual’s tendency to 
adopt a constricted scale or an extended scale 
and his characteristic pattern for distributing 
items within the scale may indirectly reveal 
the individual's attitudes more clearly than 
the answers given to standard attitude scale 
items when one is conscious of the possible 
interpretation by others of each item he 
answers (cf. Campbell’s[1] discussion of 
“disguised-structured” methods of indirect 
attitude assessment). 

The specific problem to be investigated is 
the effect of the individual’s position on an 
issue on (a) the extension or constriction of 
the scale which he finds suitable for placing 
the various stimulus items and (4) the char- 
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acteristic distribution of items along the scale 
which he establishes. On the basis of theo- 
retical considerations derived from work on 
perception and judgment, it is reasonable to 
predict (a) that Ss with strong personal 
involvement in an extreme position will tend 
to establish a scale with a smaller number of 
categories than Ss not so strongly involved 
in the issue; and (4) that Ss with strong 
personal involvement in the issue will tend 
to be very discriminating (“choosy”) in 
accepting items at their own end of the scale, 
and correspondingly tend to lump statements 
together at the end of the scale which they 
reject. The tendency to be discriminating 


in accepting items at one’s own end of the 
scale may be described as a raised “threshold 
of acceptance,” and the latter as a lowered 
“threshold of rejection.” 


METHOD 


Procedure 


The procedure was planned to satisfy three gen- 
eral requirements: First, it was considered essential 
that Ss not be aware at the outset that the study was 
primarily concerned with their attitudes. Second, 
for an adequate test of the hypotheses, it was neces- 
sary to choose an issue in which the Ss would indeed 
be personally involved. Finally, it was important 
that the statements to be categorized include a num- 
ber of “neutral” or ambiguous items which would 
be most susceptible to differing interpretations. 

Attitude toward Negroes was chosen as an issue 
appropriate to the above considerations, for reasons 
elaborated in an earlier paper (4). The stimulus 
materials were 114 statements concerning the social 
position of Negroes used by Hinckley (3). The 
statements were originally compiled to represent a 
range from a very pro-Negro to a very anti-Negro 
stand, with a large number representative of the 
middle range where variability of judgment is 
greater. The original items included a fair number 
which were too ambiguous to use in the final ver- 
sions of the Hinckley scale, but were of interest for 
the present purpose. 

The cata to be pressnted were obtained in con. 
junction with the study previously reported (4). It 
was originally planned to have all Ss serve under 
both “imposed category” (standard Thurstone type) 
and “unrestricted category” (Ss free to establish 
their “own categories”) conditions, with a two-week 
interval in between, and with half of the Ss starting 
with “imposed” and half with “unrestricted” cate- 
gory conditions. Because of practical difficulties in 
obtaining two sets of sortings for all Ss in this way, 
the plan had to be modified somewhat. Of the Ss, 
151 served first under the “imposed category” con- 
ditions, while 30 served first under the “unrestricted 
category” conditions. To increase the number of 
cases for the present analysis 61 additional Ss were 
secured who only served under one set of conditions, 
the “unrestricted” category instructions. 
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Care was taken to insure careful and uniform 
administration of the experiment to relatively small 
S groups, so that they could be under close super- 
vision by the Es. The administrators were Negro 
for the Negro groups, and white for the white 
groups. Moreover, they were chosen to appear to 
the Ss to be “one of them.” 

The task was presented throughout as simply 
categorizing opinion statements. No mention of the 
S’s stand on the issue was made until after the cards 
were sorted, 

The Ss were each given an envelope containing 
the 114 statements, randomly shuffled, and instruc- 
tions which pertained only to sorting. After the Ss 
were allowed 3-4 minutes to read the instructions, 
E read aloud: 


You are given a number of statements expressing 
opinions in regard to the social position of Negroes. 
These cards are to be sorted into different piles. 

You will find it easier to sort them if you look over 
a number of cards, chosen at random, before you begin 
to sort. 

You are to sort the cards into the number of piles 
that may seem necessary to you so that the stand ex- 
pressed on the issue of the social position of Negroes 
will be different from the other pile or piles. You may 
sort into any number of piles which in your judgment 
is necessary so that each pile of cards represents a dif- 
ferent stand on the issue. Put statements into the same 
pile which belong together in terms of their relative 
stand on the issue, that is, favorable, unfavorable, etc. 
This should determine how many piles you have when 
you finish sorting. 

This means that when you are through sorting, you 
will have different piles of statements arranged in order 
from low or lowest to high or highest. 

When you are through sorting, please put a rubber 
band around each of the piles. Write the number /] on 
top of the pile of cards which is most unfavorable 
regarding the social position of Negroes. Write Il 
on top of the next pile. The Jast pile you number will 
be that pile of cards which is most favorable regarding 
the social position of Negroes. 

If you complete the sorting before others, please remain 
quietly in your seat until final announcements are made 
by the experimenter. 


The Es answered any questions, but were 
cautioned not to clarify instructions in terms of the 
stimulus material at hand or of the Negro issue. 
During the sorting, which lasted about an hour, E 
inconspicuously observed S$ to see that the task was 
being conscientiously performed. 

When all Ss had completed the sorting, the second 
instruction sheet, which aimed at determining S’s 
attitude on the issue, was distributed. These instruc- 
tions read as follows: 


Now that you have completed the sorting of cards into 
piles, pick up that pile of cards which comes closest to 
your view on the issue (that is, the social position of 
Negroes). On the top card of that pile, please write the 
word “agree.” After writing the word “agree” please 
write one of the following to indicate the degree of 
your agreement with that pile: (1) “very strongly,” 
(2) “strongly,” or (3) “mildly.” 

Now pick up that pile of cards which is most objec- 
tionable from your point of view. On the top card of 
that pile, please write the word “disagree.” After writ- 
ing the word “disagree” on this pile, please write one of 
the following to indicate the degree of your disagreement 
with that pile: (1) “very strongly,” (2) “strongly,” or 
(3) “mildly.” 
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When you have completed the above, please replace 
the piles of cards, each carefully separated by its rubber 
band, in the envelope. 

This is part of a scientific research project. The 
results of each person's sorting will be treated as scien- 
tific data and will be confidential. 


Finally, a Likert attitude scale (5) was adminis 
tered to all white Ss as a check on their attitudes 
toward Negroes. 


Subjects 


The attempt was made to choose persons for Ss 
on the basis of their membership in groups whose 
general stand on the social position of Negroes was 
known. Although this procedure proved satisfactory 
for Negro Ss and one small group of pro-Negro 
white Ss, it was not possible to secure a large group 
with extremely anti-Negro sentiments who were 
comparable in education and other factors to the 
other groups. To get a minimum number of cases 
to represent this stand it was necessary to include 
members of the “average” white groups with the 
most anti-Negro attitude scores. 

Negro Judges. (a) The most extremely pro group 
consisted of 41 mature Negro graduate and summer- 
session students at the University of Oklahoma. 
These students were attending the University in the 
year following the Supreme Court decision ending 
segregation and had an accentuated identification 
with members of their group. They also had more 
actual contact with discriminating practices than the 
younger Negro Ss. 

(6) The next most extreme pro group was a 
younger group of 41 Negro undergraduates at an 
all-Negro university. While segregation and aware- 
ness of anti-Negro feeling had been a part of their 
life, members of this group had grown up largely 
in a Negro environment. The university was also 
located in a Negro community, so this relatively 
“sheltered” condition was continued in academic 
life. 

The total of 82 judges from these two groups is 
used in the main comparisons below. 

White Judges. (a) The most extreme pro-Negro 
Ss among the whites were a small group of 11 
students who were the leaders in a campaign at the 
University of Oklahoma to end segregation. 

(6) The “average” white Ss came from several 
different colleges in the South. Two of the colleges 
were located in southeastern Oklahoma and four 
were in Georgia. The number of years of education 
was approximately the same as that of the Negro Ss. 
A total of 142 judges came from these groups, and 
these are the cases used in the main analyses below. 

(c) Seven Ss were located who were known by 
one of the dormitory counsellors who assisted in 
the study to have strongly anti-Negro sentiments 
as expressed in spontaneous comments on the issue. 
An additional group of seven judges was obtained 
from the “average” white sample on the-basis of 
their anti-Negro attitudes on questionnaires. All 
14 judges had indicated that they agreed with the 
pile of statements which they described as “most 
unfavorable” to Negroes, and had anti-Negro scores 
on the Likert Attitude Scale (scores below 50). 
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REsuLTs AND Discussion 


Number of Categories Used by White and 
Negro Judges 

The data of Table 1 indicate the number 
of categories selected by the groups of 142 
“average” white and 82 Negro Ss as necessary 
to distinguish appropriately different stands 
on the issue. The results indicate a clear 
tendency for the Negro Ss, who are intensely 
involved in an extreme position on the issue, 
to employ a smaller number of categories 
than white judges, who are less strongly in- 
volved. The percentage of Negro Ss using a 
constricted scale (4 or fewer categories) is 
significantly greater than the percentage of 
white judges (p—=<.002 by ,’ analysis). 
Thus the results support the hypothesis that 


TABLE 1 


Numser or Catecorits CHos—EN TO REPRESENT 
Dirrerent Postrions sy WHITE 
anD Necro Jupces 








NuMBER oF CaTecories NecrojJupces Wuits Juposs 
° ° 





4 or less 67.1 
5 or more 32.9 


N 82 
x?=10.3 
p=<.002 


45-1 
54-9 
142 





when individuals are allowed to choose the 
number of categories they consider appro- 
priate to distinguish a series of statements 
ranging from pro to con, the number of cate- 
gories which they adopt is affected by their 
own position on the issue. Intensely involved 
individuals with an extreme position are more 
likely to adopt a constricted scale (with few 
categories). Correspondingly, individuals not 
so intensely involved in an issue are more 
likely to choose an extended scale (with many 
categories). 


Distribution of Items Within Categories 


It is to be expected that a second way in 
which an extreme position and intense in- 
volvement will affect the individual's scaling 
will be in the number of statements assigned 
to the extreme categories, in line with our 
earlier results showing displacement of “neu- 
tral” items. Such an effect was found, shown 
by a piling-up of statements at the anti-Negro 
end whatever the number of categories used. 
Figure 1 shows distributions of items for 
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those white and Negro judges who chose 
three categories, and the corresponding distri- 
butions for those who chose six categories. 
Negro Ss who used three categories and those 
who used six categories both tended to place 
a large number of statements in the extreme 
anti-Negro categories, while the white Ss in 
each case tended to distribute their judgments 
more evenly over all the categories. The 
results are in line with the hypothesis that the 
more extreme the individual’s position and 
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the stronger his personal involvement the 
greater the likelihood that he will be very 
discriminating (“choosy”) in accepting state- 
ments at his own end of the scale. The 
individuals’ threshold of acceptance may be 
described as raised, and his threshold of rejec- 
tion as lowered. In everyday language this 
is the tendency for individuals with intense 
involvement to feel that “those who are not 
for us are against us,” so that even mildly 
critical statements are perceived as hostile. In 
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actual life siutations, some individuals view 
even minor deviations for their cherished 
position as “intolerable,” while other individ- 
uals may find the same degree of deviation 
quite tolerable. 


Prediction of the Judge’s Own Position on the 
Basis of the Number of Categories He 
Employs and the Extent to Which He 


Concentrates Items at the Extremes 


As mentioned earlier, an incidental purpose 
of the present investigation was to explore the 
possibility that the manner in which individ- 
uals formulate their own scales and distribute 
statements into the various categories might 
furnish a useful behavioral index of the indi- 
vidual’s own attitude. A partial test of the 
feasibility of this technique was to devise an 
index summarizing the individual S’s tend- 
ency toward constriction and concentration 
and then determine the extent to which dif- 
ferences were obtained between various sub- 
groups whose general position on the Negro 
issue was known. Five groups were avail- 


able for this analysis. The two Negro groups 
would be expected to have different involve- 
ment in the Negro issue, due to differential 


exposure to various discriminatory practices. 
The three white groups had also been selected 
to represent different stands on the Negro 
issue. One was made up of pro-Negro cam- 
paigners for equal rights for Negroes. A 
second group used was unselected with re- 
spect to attitude and was an “average” group 
of whites with about the same number of 
years of education as the Negro groups. The 
third group was made up of individuals 
whose answers on attitude scales were indica- 
tive of an anti-Negro stand. 

A two-item index was devised to sum- 
marize the degree of constriction of the indi- 
vidual S’s scale and the extent to which it 
was skewed in the anti-Negro direction. The 
cutting point used for constriction-extension 
placed those who used two, three, or four 
categories in the “constricted” category and 
those who used five or over in the “extended” 
group. It was more difficult to secure a basis 
for differentiating Ss in terms of the extent 
to which they tended to concentrate items at 
the extreme anti-Negro end of the scale, since 
degree of concentration would vary with the 
number of categories used. After consider- 
able exploration it was decided to develop an 
index based on a chi-square analysis of the 
extent to which the concentration in the 
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extreme category is significantly greater than 
the value to be expected on the basis of an 
equal assignment to each category. Thus 
when an individual uses only two categories 
an equal assignment would iavolve placing 
57 of the 114 items in the first category and 
57 in the second. Placing 72 of the 114 items 
in the first category represents a significant 
deviation from equal assignment (p= <.01). 
Accordingly, individuals who placed 72 or 
more in the first category were indexed as 
having significant skewing of their distribu- 
tions. Similarly, with three categories, on an 
equal assignment basis one would place 38 in 
the first category and 76 in the other two. 
Placing 52 items in the first category would 
represent a significant deviation from equal 
assignment. The corresponding values for 
the other categories are 41 for four, 35 for five, 
30 for six, 27 for seven, and 24 for eight cate- 
gories. With these values it was then pos- 
sible to state that the individual did or did 
not have a significant concentration of items 
in the first category with equivalent meaning 
for different numbers of categories chosen. 

Each individual was then given a scale 
value based on his tendencies to constrict or 
extend his scale and to concentrate or not 
concentrate the items in the extreme position. 
A scale value of 2 was assigned to the indi- 
vidual who showed both constriction and 
concentration. A value of 7 was assigued if 
the individual concentrated the items but did 
not constrict the scale, and a valuz of a if 
neither constriction nor concentration was 
indicated. The average scale values for each 
of the criterion groups are presented graphi- 
cally in Fig. 2. There is a clear progression 
in average scale value as one goes from anti- 
Negro to intensely pro-Negro groups. The 
average scale value for the latter group is over 
12 times that of the former. The differences 
between the two Negro groups, between the 
pro-Negro and the anti-Negro whites, and 
between the less extreme Negro group and 
the “average” white group, are all significant 
at less than the 1 per cent level. 

The extent of correlation between the be- 
havioral index and the attitude predisposition 
makes further research along similar lines a 
promising field of investigation. The next 
step would probably be a repetition of the 
study using new and independent groups. 
Research with other issues is also indicated, 
preferably with a complete range from pro 
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to con, and with intense personal involvement 
at the extremes. In the present study we were 
unable to get extreme anti-Negro groups 
(Ku Klux Klan, etc.) who would be com- 
parable in educational level with the other 
groups used, 


Correspondence between Results under “Im- 
posed” and “Unrestricted” Category 
Conditions 


It will be recalled that some of the Ss served 
under both “imposed” and “unrestricted” 
category conditions. Under the former the 
subjects were instructed to sort the statements 
into 11 categories in the manner employed by 
Thurstone. It was found that the Ss who 
used a small number of categories to express 
different shades of opinion under the unre- 
stricted conditions tended to be those who 
concentrated their statements at the anti- 
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Negro end of the scale under the “imposed” 
11-category conditions. There was a strong 
tendency for such Ss to neglect or even 
ignore some of the categories in the middle 
segment of the scale, so that in a sense they 
reduced the 11-category scale to a smaller 
number even under the “imposed category” 
conditions. One expression of the relation- 
ship is presented in Table 2, which shows the 
number of categories selected as being neces- 
sary to express the different shades of opinion 
under “unrestricted category” conditions by 
Ss who had varying degrees of concentration 
of their items in the extreme anti-Negro cate- 
gory under “imposed category” conditions. 
There is a significant relationship between the 
two indices of behavior, which is of course a 
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reflection of the common factor of attitudinal 
difference between the groups.” 


SUMMARY 

1. As a test of predictions derived from 
prior work in perception and judgment, 
white and Negro Ss, who represent markedly 
different stands on the issue of the social posi- 
tion of Negroes, were asked to sort opinion 
statements on this issue, using whatever num- 
ber of categories they considered appropriate 
to represent the various shades of opinion on 
the subject. Some of the Ss also carried out 
categorization of the same items under in- 
structions to sort them into eleven categories, 
the procedure usually used with the method 
of equal-appearing intervals. One hundred 
and forty-two white and eighty-two Negro 
Ss were used in the main comparisons. 

2. It was found that the more intensely in- 
volved Ss (the Negroes) were more inclined 
to use a constricted scale with a smaller num- 
ber of categories than were the less involved 
white Ss. The difference in frequency with 
which two, three or four categories were 
used as compared with five or over was sig- 
nificant at less than the .oo2 level (using a 
x* analysis). 

3. There was, as in the portion of the study 
reported earlier (4), a marked tendency for 
the intensely involved (Negro) Ss to place a 
disproportionate number of items at the end 
of the scale removed from their own position. 
In other words, statements which seemed to 
most white Ss to be only mildly critical of 
Negroes were perceived as being very hostile 
by Negro Ss. 

4. The Ss who chose a small number of 
categories under “unrestricted category” con- 
ditions were individuals who were also in- 
clined to concentrate the statements in the 
extreme category when they did the sorting 

2 Of some methodological interest is the question as 
to whether the Ss who carried out the categorization 
without restriction on the number of categories were 
influenced by prior experience with categorization where 
the number of categories was imposed by the E. Two 
groups of judges were available for such a comparison, 
where comparable judges were assigned to groups in 
which the unrestricted categorization was carried out 
first, or to groups in which the “imposed category” con- 
dition preceded the “unrestricted category” conditions. 
It was to be expected that considerable transfer might 
obtain, i.e., that Ss once familiar with the condition in 
which 11 categories were specified by the experimenter 
would use more categories under the “unrestricted cate- 
gory” conditions. No significant effects of this type 
were obtained. The frequency distributions of number 
of categories chosen under the two conditions were not 
significantly different (py 2=.55 for one sample and .75 
for the other). 
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task under conditions where they were in- 
structed to use eleven categories for their 
scaling (py2—=<.001). 

5. An incidental interest in the present 
study was the possibility of using the individ- 
ual’s manner of establishing his own scale and 
assigning items to categories as a behavioral 
index of attitudes. For this analysis five sub- 
groups from the total sample were used to 
represent different positions on the issue. 
These “criterion groups” were, in order from 
most pro to most anti, (a) older Negro 
students in a university which was admitting 
Negroes for the first time on a nonsegregated 
basis, and where intense identification with 
their group obtained, (6) younger Negro 
college students in an all-Negro university 
in a Negro community, with more “shelter- 
ing” from discrimination, (¢) pro-Negro white 
students, members of anti-discrimination, 
equal rights organizations, (d) “average” 
white students, (¢) “anti-Negro” students, 
selected on the basis of attitude scale scores. 

For each of these groups an average scale 
score was calculated, based on the number of 
categories which the individual selected and 
the presence or absence of a tendency for him 
to concentrate a disproportionate number of 
statements at the anti-Negro end of the scale. 
A clear progression of scale scores in line 
with the a priori categorization of the groups 
was obtained. The average score for the 
most pro-Negro group was over 12 times as 
large as that for the anti-Negro group. The 
correlation between the behavioral indices 
and the criterion categories is sufficiently 
great to make this type of indirect test of 
attitude a very promising field for further 
investigation. 
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CASE REPORT 
A CASE STUDY OF FETISHISM 


VERNON W. GRANT 


Hawthornden State Hospital 
Macedonia, Ohio 


HE literature on pathological fetishism 
| is not rich in first-hand accounts by 
affected persons capable of describing 
their experiences in an articulate and insight- 
ful way. The case to be presented is that of 
an individual of considerable intelligence and 
of above-average expressive capacity. Certain 
features of the case are distinctive. The 
subject of the study is fully capable of appar- 
ently altogether normal sex-emotional attrac- 
tion and attachment, and is thus in position 
to compare the affects associated with this 
experience with those of fetishism. Further, 
a possible practical solution of the problem 
for one type of fetishism, perhaps for only 
certain representatives of this type, is offered. 
The subject achieved, in relatively simple 
fashion, a successful method of satisfying his 
needs, altogether free of the ordeals and 
hazards to which victims of these abnormali- 
ties are so often exposed. 

An attempt at comprehensive descriptive 
definition of sexual fetishism would probably 
make a cumbersome statement and would be 
of no particular relevance to this study. 
Within the context of the case history to be 
presented, we may use the term with refer- 
ence to those deviant forms of erotic behavior 
in which, most frequently, a limited portion 
of the body, or an article of clothing, of a 
sexually attractive individual constitutes the 
more or less exclusive focus of interest; this 
interest being manifested in unusual fascina- 
tion with the object or body part and in a 
desire to possess, caress, or fondle it, typically 
accompanied by some degree of genital ex- 
citement. There are admittedly many and 
various instances of the rarer forms of 
fetishistic behavior (e.g., involving certain 
specialized visual, tactual, olfactory, or situ- 
ational experiences) to which so simple a 
formulation would not apply (3, 5). In some 
cases, according to the accounts, the fetish 
object or stimulus may exercise its effect in 
complete dissociation, not only from the 
primary sexual stimuli but from the very 
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existence of the bearer of the fetish. Thus 
there is the hair fetishist who can say, “For 
me the girl herself is nothing; her hair, every- 
thing that I desire.” 

There has been a tendency in studies of the 
fetishist to stress his relative indifference to 
primary sexual stimuli, in sharp contract with 
his intense attraction to features commonly 
rated as incidental. In our subject the only 
deficit in this respect is his indifference to the 
feminine breast. More striking is his capacity 
for complete sex-emotional experience of the 
“romantic” type on the basis of nonfetish 
traits. In this case fetishism appears simply 
and mainly as an exaggerated susceptibility 
to one feature of sexual attractivencss within 
a setting of normal responsiveness to such 
familiar kinds of “charm” as facial beauty, 
voice, grace, and other personal qualities. 
During normal love attachment the subject 
reports a temporary abeyance of fetish inter- 
est. On analysis, this was disclosed to be the 
effect of repression: when “in love” he found 
his fetish interest so far in conflict with the 
exalted emotions awakened by his attachment 
that he was able, with very little effort, to 
banish such inclinations from consciousness. 

Apart from its bearing on the problem of 
the place of this form of erotic abnormality 
within the general theoretical framework of 
sexual psychology, the case offers some evi- 
dence of the need of effort directed toward 
public enlightenment concerning the nature 
of these disotders. There are doubtless many 
who still view them as perversities rather 
than as maladies, as delinquency rather than 
as disease. 

Contact with the subject, who is not and 
has never been a psychiatric hospital resident, 
was made independently of the writer’s pro- 
fessional functions. 


Tue Susyecr 


The subject is a 35-year-old male, American born, 
of American-born parents. He is white and unmar- 
ried. He has engaged in several different relatively 
unskilled occupations. He is largely self-educated, 
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in good physical condition, and without history of 
serious somatic ailments. He was enuretic until 
about 12 years of age. He suffered chronically from 
nightmares, “all my life, as far back as I can 
remember. For years it seemed like almost every 
night.” These occurred, typically, immediately after 
falling asleep. He reports also much talking in 
sleep. A paternal aunt is psychotic; a younger 
brother is described as rather neurotic, subject to 
depressions in early life. His mother is of life-long 
“nervous and anxious” disposition. He first became 
aware of his peculiarity as a recognized sexual devi- 
ation through reading Krafft-Ebing; he was amazed 
and deeply disturbed, felt he was a misfit. He 
finally came to accept the abnormality—intellec- 
tually, at least—without guilt feelings, realizing he 
was in no way responsible. His peculiar sex needs 
have involved him, through the years, in a variety 
of experiences, some rather stressful in character, 
owing to the risks to which he was forced in pur- 
suit of his objectives. His account follows. 


Tue Susyect’s Account 


At the age of 35 I can look back over many 
years of experience with sex fetishism. I can 
remember an interest in women’s shoes as 
early as my fourth year, and that this interest 
continued during my grade-school days. I 
remember fantasies of women wearing high- 
heeled slippers of dainty design. There were 
also secret periods of play with my sister’s 
and mother’s shoes in the clothes closets. 
Shoes had no sexual meaning at this time. 
I don’t know why I felt that secrecy was 
necessary. If I was ever observed I was never 
punished. I recall clearly that when I was 
four years old, just before Christmas, an aunt 
visited. I know it was my fourth birthday 
because I received a set of toy soldiers, and 
because it was before I started kindergarten. 
She wore high-heeled shoes. I remember 
watching, and I know that my attention was 
centered on her shoes, that I was specifically 
interested in her shoes to the complete exclu- 
sion of everytyhing else about her. 

I masturbated at age nine, and I am posi- 
tive that I thought of women’s shoes during 
the act. I remember too when I first dis- 
covered the type of shoes, in an illustration 
in a book, that excited me most. At age ten 
or thereabouts I recall staying with my aunt. 
I was allowed to sleep with her because I was 
troubled with nightmares. I once went 
through her dresser drawers looking for 
hidden Christmas presents when I found a 
pair of high-heeled shoes. I got very excited, 
as if I had found a treasure. That night 


when she came home I persuaded her to put 
them on by saying I thought they must be 
too small for her. She did so and then I tried 
to get her to take a few steps, but she refused, 
saying it was silly. I was very excited during 
all this, though I do not recall that it was 
sexual. However, I do remember having an 
erection when I once saw her putting her 
hose on one morning. Also that one day 
when I asked about the shoes she said she 
had given them away and that she did not 
want to discuss it. I was given a bed of my 
own and never permitted to sleep with her 
again. I am sure she felt there was some- 
thing strange about my interest in the shoes. 

When I was 12 years old I was attracted 
to my sister’s shoes. I used to wait for her 
to sit-at the table. I had a strong urge to 
get under the table. I would pretend to drop 
something to get under it. I was interested 
only in her shoes. I’m sure I had no other 
interest, as her legs were thin and uninterest- 
ing. I used to dream about this too. In the 
dreams she would put on pumps and then 
touch my genitals with them. Even in later 
years I had dreams in which I followed her 
to the bedroom and. asked her to put on the 
patent leather pumps. I was never conscious, 
in the dream, of arousing her suspicions of a 
sex motive in asking her to put the shoes on, 
but it always happened that when she did so 
she would extend her foot forward to press 
against my genitals and then the dream 
would end. 

There were other examples during my 
teens when I became excited over a girl’s 
shoes, especially when she had shapely legs. 
In my seventeenth year I met a girl who 
attracted me strongly at once. Although I 
was conscious of her figure, neither her shoes 
nor her legs aroused more than a slight 
interest. My passion was essentially mental 
and free from any conscious sex disturbance. 
Demonstrations of affection were limited to 
good-night kisses infinitely tender. I loved 
her with intense emotion entirely devoid of 
any lustful thoughts or desires. The romance 
ended when she went away to college, and I 
suffered painfully intense heartbreak over it. 

Soon after this I secured employment and 
managed to recover from my depression by 
heavy physical labor. For several years I had 
little interest in women and rarely mastur- 
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bated. I put all my energies into work. 
Sometime around my twenty-fifth year my 
interest in shoes regained its strength. I 
began to frequent the waiting rooms of rail- 
road stations. Sitting opposite a woman I 
would concentrate on her shoes and legs. 
In the earlier stages of this practice I would 
masturbate through my pocket, concealing 
this by holding a newspaper over my crossed 
legs. At a later stage I obtained satisfaction 
without mechanical means, the effect of look- 
ing at the shoes and .cys being enough in 
itself to produce orgasm. What excited me 
most was patent leather pumps with French 
heels, of simple design and free of decora- 
tions like straps and bows. At times the 
nervous tension became so great as to cause 
visible trembling in a kind of ecstasy. Shoes 
alone were not enough, however. In addi- 
tion I needed shapely ankles and sheer hose, 
preferably the darker shades. The effect was 
much increased by movements of the foot. 
No part of the leg above the knee held any 
interest for me. 

I have never experienced the intense nerv- 
ous excitation, in anticipating normal sexual 
intercourse, comparable with the effect of an 
opportunity to see legs I consider especially 
beautiful, and provided circumstances are 
such that I am free of feelings of anxiety. 
I have noticed that, in watching a woman’s 
legs in a public place, if there is a chance I 
may lose sight of her I become overeager 
and anxious, and this sometimes makes me 
impotent. If I can manage to relax and con- 
centrate on the sheer enjoyment without 
thinking of an interruption, I then experience 
rising excitement and climax much more 
quickly. At the period when my fetish pas- 
sion is most active, the effect of shapely legs, 
even when glimpsed for only a moment, is 
enough to set me on fire. The effect is not 
localized in my genitals, but seems spread 
over my entire body. There is a heat sen- 
sation, as if my blood has been warmed, 

I have suffered a great deal of anxiety dur- 
ing my visits to public places where it is easy 
to watch women’s shoes and legs. In addi- 

. tion to the feeling of guilt which I still feel 
at times, there is the danger of detection. 
Plainclothesmen have haunted me, and more 
than once I have been questioned on sus- 
picion of loitering. I have tried to be casual 


in my behavior at waiting rooms, but the 
length of time I spent there, the constant 
moving about for a vantage point, and my 
fixed scrutiny when I found one would 
attract attention and I would be warned to 
“shove off or else.” It was not easy to find 
what I was looking for, since in addition 
to the proper type of shoe and the well- 
shaped ankles and legs, I needed a certain 
amount of movement, for example the slight 
swinging of the leg when one rests across the 
other. If the foot is motionless there is no 
increase in stimulation, no “build up.” In 
order to get this there must be movement. 
The more gyrations of the foot and leg, the 
more the stimulation. If I know a woman 
is conscious of my watching her legs it 
increases the excitement. Another thing is 
that similar shoes and legs have different 
effects if in ene case the woman is distin- 
guished-looking and refined, and in another 
case she is coarse-looking and “vulgar.” 

My sexual condition is important, too. 
After several weeks without satisfaction the 
urge reaches a strength where just prolonged 
staring at the legs and shoes of a mannequin 
in a store window will be enough to give me 
an orgasm. This has happened a dozen 
times, at least. Once I tried to buy a man- 
nequin. However, I have to be in a highly 
excitable condition for this kind of satisfac- 
tion. If I buy the kind of shoes I prefer and 
ask a woman I know to wear them for me, 
it doesn’t have the same appeal as if they 
were her own shoes. I guess this is because 
they don’t seem to be as much a part of her. 

Certain types of hands excite me too, but 
only if I am already aroused. Once I had an 
orgasm watching a woman’s hand, but I had 
zlready been strongly aroused. Otherwise, 
I am completely indifferent. It is harder to 
find the right type of hand than the proper 
kind of shoe and leg. It must be long and 
slender, and preferably with painted nails, 
Sometimes the hand arouses excitement, but 
there is no build-up. It’s the same way with 
a woman’s buttocks. Four or five times I've 
reached orgasm through watching a woman’s 
buttock movements in walking, but only 
when I’m already excited. I can’t start cold 
that way. As to prostitutes, I always take 
the proper shoes and hose with me when I 
go to them. I'd say that the sight of the 
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genitals is normally stimulating to me. In 
having intercourse I sometimes have the 
woman take a position so I can see her geni- 
tals during the act. 

My techniques in pursuit of my fetish 
were a product of experience and involved a 
certain amount of skill and manoeuvering. 
For example, I spent much time in public 
libraries because there was less danger of 
being watched. By walking past a woman 
who was seated with a book and stopping 
for a moment directly in front of her to stare 
at her foot, I would gain her attention and 
she would watch me until I had taken a seat. 
Then I would continue to look, but in a 
covert manner, as though conscious that I 
was acting at variance with popular ethics. 
Usually the woman would conceal as much 
of her ankles as possible, sometimes even 
taking her shoes off and sitting on her feet. 
When this occurred I would smiie in a smug, 
amused way to give her the impression she 
was being laughed at, with the result that 
she would often resume her position and per- 
haps even expose more of her legs to indi- 
cate her contempt for my opinion. I would 
religiously avoid catching the woman's eye, 
concentrating entirely on her shoe and ankle 
to convince her that my interest was imper- 
sonel and confined to the shoe. If she moved 
her foot experimentally I would exaggerate 
my pleasure by facial animation, sometimes 
parting my lips to convey passionate interest. 
My whole bearing would suggest intensity 
of sensation. Sometimes this defeated my 
purpose, the woman becoming alarmed, but 
these instances were rare. It was also rare 
for a woman to change her seat or leave the 
premises as a result of my interest. Usually 
my model would continue to move her foot 
experimentally, sometimes even getting up to 
walk back and forth along the bookshelves 
near my seat and taking poses to reveal her 
underpinning to the greatest advantage. She 
would sometimes walk in a teasing manner, 
rotating on the balls of her toes with each 
step. I learned to catalogue various types: 
the essentially vain, the purely experimental, 
the naive who only partially understood and 
were indignant, and the stupid, who sat on 
their feet and were very uncomfortable as a 
result; also the frustrated, who opened a con- 
versation for the purpose of arranging a date 
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and subsequently a bit of sex for recreation. 
On several occasions a woman cither ap- 
proached me directly to make my acquaint- 
ance, or dropped a note on the table as she 
strolled by. I seldom responded to these 
notes. It was risky. On one occasion I fol- 
lowed such a request and the party turned 
out to be a policewoman who would have 
run me in had I not bought her a pair of 


nylons. 


In a few instances in which contacts made in 
public places led further, the subject reports he 
required the shoe to be worn during coitus and 
describes the fetishistic sex experience with em- 
phasis on positions permitting the shoes and legs 
to be visible during intercourse. It is noted that 
he persuaded one partner to stand upon his bare 
stomach, and in one other case refers to a “masoch- 
istic thrill.” He repeatedly stresses the importance 
of movement, and states that even well-shaped legs 
and the preferred type of shoes, without movement, 
arouse him very slowly. Foot pressure was also a 
stimulus, and he tells of excitement aroused by the 
accidental pressure of a woman's foot, as when 
standing in a crowd, if exerted with sufficient force. 
On one occasion he became sexually stimulated by 
the sound of the heel taps of a woman walking by, 
and not at the moment visible to him. 


During a period of unemployment I be- 
came completely devoted to pursuit of the 
fetish. For over a year I did not have an 
adequate income, frequently sleeping in parks 
and boxcars and often not getting enough to 
eat. At this time my fetish became like a 
disease. I was a sick man and I knew it, but 
any resolutions I made to defeat the impulse 
vanished once I sighted the fetish. The 
fascination made me simply helpless. I 
finally reached the stage where nothing else 
mattered. I stopped caring where I slept 
and went for days with little food. It seemed 
that the urge became more difficult to satisfy 
as my vitality waned. For three or four days 
at a time I would experience interrupted 
orgasms without ejaculations, walking about 
town in constant search. I visited bus sta- 
tions, hotel lobbies, hospital waiting rooms, 
and even churches—any place where a 
woman wearing my fetish needs might be 
found seated long enough to make an ejacu- 
lation possible. I could not reach orgasm by 
mechanical means now. Attempts to mastur- 
bate curbed the sensation. The orgasm had 
to be spontaneous. Desire to urinate helped. 
There were times when I was ready to burst 
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from restraint. Under prolonged stress I 
suffered not only from the maddening frus- 
tration but also from excruciating headaches 
which affected my vision and paralyzed my 
neck muscles. When finally I succeeded in 
breaking this extreme nervous tension the 
orgasm would be so feeble as to lower my 
spirits, robbing me of any satisfaction. After 
experiences of this kind I would make new 
efforts to rehabilitate myself. 

Often I have boarded trains in pursuit of 
a woman wearing my fetish. Sometimes the 
starting whistle of the engine would induce 
ejaculation. But if I had not been able to see 
the legs or the shoes of the woman after she 
boarded the train because of luggage cutting 
off the view, or failure to get a seat opposite 
her, I would return to the platform and look 
through the window. This was possible only 
when the tracks were lower than the plat- 
form level. Sometimes my excitement would 
be so intense as to cause me to run beside the 
moving car. I have had ejaculations while 
running at almost top speed. On one occa- 
sion my absorption was so great that I fell 
off the end of the platform and was knocked 
unconscious. My peculiar behavior in rail- 
road stations was sometimes checked by the 
police, but after questioning I was released. 
I never gave my true motive for chasing 
trains. Twice I was jailed under charge of 
disorderly conduct, being given 15 days of 
imprisonment altogether. 

My fetishism has much influence on my 
tendency to develop interest in a girl. The 
discovery that a girl I have just met has well- 
shaped legs has a strongly quickening effect 
on my interest in her as an individual. A 
girl might appeal to me very much in features 
and personality, but if I discovered her legs 
were ugly I am certain I would lose interest. 
On the other hand I am equally certain that 
once I have fallen in love, apart from any 
effect of the legs, the discovery that the latter 
were unattractive would alter my feeling very 
little. 

Twice I have been very much in love. On 
the second occasion it was with a much 
younger girl. She was very fond of me and 
permitted very ardent kissing. I can truth- 
fully say that I was completely happy just 
confining my affection to kisses and embraces. 
I never touched her breasts or any part of her 
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body that was sexually vulnerable. Even 
though I got an erection at these times I had 
no real desire to go further. I was content 
just to kiss her again and again and to hold 
her close. The thrill was not progressive. 
My passion did not increase by the long 
embraces. I simpiy felt immeasurable bliss 
and happiness while holding her. When 
separated from her I was miserable. I never 
thought of her as a bed companion. Despite 
her unusually shapely legs I had an actual 
slight aversion toward looking at her shoes. 
In explaining this I would say that because 
my love for her was essentially spiritual I felt 
it would be a kind of sacrilege to look at her 
shoes or legs. I suppose this was a throw- 


back to my old prudish inhibitions. 


The subject’s capacity for apparently quite normal 
“amorous” love attachment makes possible a com- 
parison of this with fetishistic attraction. He states 
that the effect upon his sensibilities of a woman 
whose shoes, legs, and person meet the requirements 
of his taste includes much more than genital sex 
excitement. The “charm” of gracefully designed 
shoes, well-formed ankles and legs imparts itself to 
the entire person. The emotion he then experi- 
ences, he states, is very similar to that of being “in 
love,” or at any rate to the early stages of this 
emotion. He stresses, however, that the emotion 
itself tends to focus upon the initially alluring mem- 
bers, and he tells us that he has experienced a 
stronger desire to kiss and to caress the attractive 
woman’s ankles, legs, or shoes, than to kiss her lips, 
“unless she is exceptionally beautiful.” Without her 
shoes, such a person immediately loses much of her 
charm; he reports such loss of interest as a striking 
experience in several instances. He refers to an 
attraction in which he felt “quite normally in love” 
with a girl whose shapely legs attracted him 
strongly, yet he feels “quite certain” that, but for 
the fetish, he would have been indifferent, and that 
the amorous state would not have developed. 

It seems significant that the effect of the fetish 
here occasionally exhibits a definite parallel with 
the traditional “love affair.” Contacts with the 
fetish object are described as having a quality of 
“sweetness, of loving caress” which he recognizes 
as different from that of genital sex arousal. This 
recalls Binet’s observation that “for fetishists, the 
sense perception of the loved object is a source of 
pleasure superior even to sexual sensation” (1). 
Apart from providing a stimulus to sexual arousal 
and gratification the fetish object is sought, he 
believed, as an end in itself. There are “two orders 
of sentiment” in these experiences, and one of them 
is a kind of adoration; it is aesthetic, “purely cere- 
bral.” The writer has elsewhere cited illustrations 
of this distinction (4). 

In the course of five arrests the subject had con- 
tacts with police authorities which registered rather 
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vividly upon his memory. Some excerpts from his 
account of these experiences follow. 


On one occasion I was sitting in the shoe 
department of a large store. A house detec- 
tive spotted me and pointed me out to a city 
policeman who watched me for a while, then 
recognized the signs of my interest in legs. 
(I learned all this later, of course.) He ap- 
proached and asked me what I was doing 
there. I said I was waiting for my wife. 
He said, “The hell you are, you God-damned 
pervert.” He ordered me to the office of the 
store, pushing me ahead of him, using vile 
language like “degenerate bastard, son of a 
bitch,” etc. The manager said he would 
press no charges in the matter so long as I 
kept out of his store. Nevertheless, the off- 
cer took me to the precinct station in the 
patrol wagon. At the station the arresting 
officer called the rest of the policemen around 
and told them I was a degenerate. They 
jeered me and made some nasty jokes. One 
of them offered me a cigarette, then asked 
me if it tasted like a penis. The police chief 
came in and asked my name. Then he took 
me into his office, gave me a seat, offered me 
a cigar. He was friendly, said he wanted my 
story, that he wanted to help me, not crucify 
me. Later he said I'd be held for three days, 
then would go to a psychiatrist, then to court. 
On the third day he sent me to the magis- 
trate, along with the probation officer. We 
went to his private office. There was a news- 
paper man there who wanted a story on the 
case. The officer refused and said he would 
never divulge the information I gave him. 
The probation officer was pretty decent. 
Then I was sent from the county jail to 
another precinct where I was held for psy- 
chiatric examination. After several days I 
saw the chief psychiatrist. After a short 
interview he decided I was not a public 
menace. He was very skeptical of my sin- 
cerity when I said I wanted to be cured. I 
was released and told I must appear before 
the magistrate I had seen previously. When 
I appeared for final judgment the magistrate 
reviewed the psyc’iiatric report and sent me 
to a mental hygiene clinic. There a young 
psychiatrist gave me about 20 minutes. He 
was very decent to me, and took some notes. 
He advised me to be as careful as possible, 
but said that I was incurable. On a second 
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visit I was interviewed by another psychia- 
trist, also sympathetic. 

At another time I was following a couple 
of girls down a city street, hoping for a good 
view of their shoes and legs. They noticed 
it, became alarmed and spoke to a policeman. 
He called to me and blew his whistle. I got 
frightened and ran. He followed, caught me 
and arrested me. I could easily have gotten 
away but was paralyzed with fright. He 
collared me and pushed me ahead of him. 
At the station I was finger-printed and put 
in the municipal jail. Later this officer came 
to my cell with a man called the public 
defender. At first he appeared to be very 
friendly, and asked me questions like “You 
like to masturbate, don’t you?” “You like 
to lay with men?” “You like to dream about 
women, don’t you?” Then he said, “You 
have dementia praecox.” He mispronounced 
the words so badly that if I had not known 
what he was trying to say I would not have 
understood him. He suggested various sex 
practices regarded as degenerate, then be- 
came incensed with me when I denied any 
such impulses. Finally he became angry 
with me and left. The next morning I went 
before the judge. I gave him a letter I had 
written the night before, explaining truth- 
fully what my motive had been in following 
the girls and giving him some of the facts 
about my abnormality. After reading it and 
weighing the evidence he was inclined to 
let me go. The arresting officer, who had 
earlier treated me with great contempt and 
indignation—told me I wasn’t human—pro- 
tested against letting me go. The assistant 
district attorney advised the judge that he 
thought I should be held for ten days’ 
observation. 

A flash picture was taken of me in court 
and published in the paper the next day. It 
finished me in that city. I lost my friends 
there. I served my ten days and was dis- 
charged. I received no psychiatric interview 
of any kind. 


A practical solution of the problem, through the 
discovery that living models were not essential to 
gratification of the fetish urge, was discovered by 
accident. 


A few years ago I saw a motion picture 
called “The Golden Calf,” based on a plot 
to find the most beautiful legs in Hollywood. 
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There were about 20 minutes of film devoted 
entirely to leg exhibitions. During these 
scenes I experienced two ejaculations the first 
time it was shown, and I followed it out to 
small neighborhood theatres with gratifica- 
tion each time. This inspired me to get my 
own camera and projection machine, and to 
develop my own films. I have taken many 
hundreds of feet of film and always enjoy 
complete satisfaction regardless of how many 
times I look at the same pictures. I have 
employed girls in brothels for this purpose, 
but the best pictures I have taken were non- 
professional girls who were under the im- 
pression that I was interested in photography 
solely for sales promotion of hosiery. I 
advertised for hosiery models in a daily paper. 
I had cards printed representing me as a 
publicity agent for an American hosiery com- 
pany and had forms made up for the appli- 
cants to fill out. I rented a large hotel room 
with a reception annex for the purpose of 
interviewing, and while the applicants filled 
out the forms I revelled in watching their 
legs. When a model met with my complete 
approval I took motion pictures of her walk- 
ing and seated which displayed her legs to 
best advantage. For this purpose I used the 
shade and texture of hose which were most 
alluring, along with patent leather opera 
pumps. I kept the three most common sizes 
of shoes on hand at all times. I prefer a long 
slender-shaped foot and long legs. The 
majority of my models were 5-8 or 5-9 in 
height. I arranged lamps before mirrors for 
lighting purposes when taking pictures and 
got satisfactory results. The models were 
never suspicious of unnatural interest in their 
legs, even when I boldly used a tape measure 
to get exact measurements of calf in relation 
to knee and ankle. During these procedures 
I frequently had orgasms and was always 
at a high tension of anticipation. 

By instructing them to sit, walk and gyrate 
their ankles and shoes in a manner especially 
exciting to me I have obtained the maximum 
enjoyment possible by no other method. 
The absence of risk and the sense of com- 
plete security is important. It is necessary to 
run the film no more than 15 minutes at the 
most to be completely satisfied, though I 
sometimes practice interruption for the sake 
of prolonging the pleasure. I never mastur- 
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bate while watching these pictures. The 
build-up is rapid and the climax more intense 
than it would be in actual association with 
living models. I have used these pictures 
without ever losing interest through famil- 
iarity. They continue to excite me with 
maximum pleasure. In no single instance are 
the pictures suggestive or lascivious; they 
are confined to leg displays not higher than 
the garter. 

I have had no intercourse in over three 
years and rely entirely on my camera as an 
outlet for sexual desires, which have dimin- 
ished from about twice a week to three or 
four times a month. Occasionally I see a 
woman on the street who fires my imagina- 
tion as a bed companion, but the urge is 
never very strong. I have experienced pleas- 
ure in normal intercourse comparable with 
that of the shoe fetishism, but this was only 
when I was infatuated or exceptionally fond 
of the girl. However, I have had only one 
experience with a girl when the shoe was 
not needed to make intercourse satisfactory. 
On a few occasions I have asked prostitutes 
about their experiences with fetishists. While 
I learned of three or four cases that resembled 
my own, none of them knew of a case of the 
use of motion pictures for this purpose. 


CoMMENT 


Concerning the intensity of the urge: 
“Nothing else in my life can compare with 
it—it’s so absorbing.” When free of attach- 
ments, and when resources and circumstances 
made possible the regular satisfaction of his 
needs without risk or anxiety, he acknowl- 
edges that he would not wish to be rid of his 
peculiar desire because of the intensity of 
the fascination, excitement, and profound 
satisfaction it offers. 

Among the founders of the scientific study 
of fetishism was Alfred Binet(1). He 
stressed the significance of forms representing 
an intermediate position between highly 
specialized and exclusive susceptibilities, com- 
monly regarded as unquestionably patho- 
logical, and those thoroughly familiar pref- 
erences which comprise the phenomenon 
of normal sexual choice. The latter he 
regarded as “minor” fetishisms. In some 
degree everyone, he suggested, is a fetishist 
in sexual interest in so far as he finds himself 
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from time to time especially attracted by this 
or that feature or trait. While the near- 
normal varieties of fetishism do not lead to 
such extravagances as the theft of hair-locks, 
or the compulsive effect of a voice of dis- 
tinctive timbre, or of a peculiarly intriguing 
grace of movement in walking, they may 
yet be the basis, Binet suggests, of certain 
liaisons or marriages which astonish everyone 
by the absence, in one member of the pair, 
of any generally recognized or commonly 
valued traits of attraction. Similar observa- 
tions on intermediate forms have been re- 
corded by Kahn (6, p. 152) and by Ellis 
(2, p. 111). 

The commonest of “erotic symbolisms,” 
according to Ellis, are those which involve the 
foot and shoe; he suggests that the frequency 
of foot fetishism may seem more comprehen- 
sible when it is considered that in some parts 
of the world, as among the southern Chinese, 
the foot is “generally recognized as a form of 
sexual attraction” (2, p. 21). He cites evi- 
dence of emphasis on the sexual value of the 
foot in oriental pornography and other data 
from which a parallel might be drawn be- 
tween the sexual value of the foot, in 
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Chinese culture, and that of the breast, for 
example, in our own. The writer recently 
noted an American magazine advertisement 
of a 16-mm. motion-picture featuring nu- 
merous close-ups of walking and dancing 
movements by girls wearing “stilt high” 
shoes of patent leather, with a six-inch heel. 
While other items of appeal were specified, 
the accent fell beyond question upon the 
movements of the feet and the character of 
the shoes. If demand for material of this 
kind is such that its production is com- 
mercially profitable, the possibility is sug- 
gested that interest in this direction, of var- 
ious degrees, may be more than a rarity. 
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CRITIQUES AND NOTES 


SOME UNFORESEEN CORRELATES BETWEEN THE STUDIES OF 
SHAW AND WALLEN 
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reviousLy published in this journal were two 
Seana articles by Shaw (1) and Wallen 
(2).. The Shaw article presented frequencies of sex 
responses on the Rorschach cards when such re- 
sponses were solicited by the examiner. Shaw's in- 
structions to his subjects (Ss) were to produce as 
many sex responses on the cards as possible. From 
his frequencies it is possible to rank each card in 
relation to the others according to how sexually 
suggestive it appeared to his Ss. 

The investigation published by Wallen indicated 
which cards his Ss preferred or disliked. He dealt 
with a group of 65 normal service men and a group 
of 30 unstable service men who were diagnosed as 
psychoneurotic. Again the data were presented, as 
in the case of Shaw’s article, in such a manner that 
it is possible to rank the cards in order of prefer- 
ence. 

The immediately apparent question relates to a 
possible connection underlying the information pre- 
sented in the separate studies mentioned. Is it 
possible that a significant relationship exists be- 
tween the preferred cards and the extent to which 
these cards are sexually suggestive? On the basis 
of the theory of repression one might expect that 
the Rorschach cards presenting to the S§ a large 
number of sexual stimuli would arouse within him 
a number of unconscious inhibitions leading him 


to a rejection of such cards when asked to make 
an overt, conscious preference. It is conceivable 
that there might be present here a working exam- 
ple of the mechanism of reaction formation. 

The standard formula for rank-difference cor- 
relation when applied to these data results in a rho 
of 651 (p=<.05), indicating a negative relation- 
ship in that the more sexually suggestive a Ror- 
schach card, the less frequently it is preferred. 
This relationship has been obtained from Wallen’s 
normal group when compared with the data from 
Shaw’s investigation. 

Theoretically one might expect such a relation- 
ship to be higher for an unstable, psychoneurotic 
group than for a normal group, since repression is 
supposedly more intense and more frequently sex- 
ually related in the neurotic population. However, 
though the relationship is higher (rho=.700, p= 
<.03), it is not significantly higher. Verification 
for this can be found in the relationship between 
Wallen’s normal and his unstable groups. In this 
case rho equals .769 with a p of less than .o2. 
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THE NATURE OF SYMPTOMS IN THE MINOR BEHAVIOR DISORDERS ' 


F. L. MARCUSE 
The State College of Washington 


YMPTOM treatment is frequently used as a term 
SS of opprobrium, for it is said, where there is 
smoke, there is fire. However, Wolberg (7) has 
pointed out situations which indicate the need for 
a therapy which approaches or is at the symptom 
level, and Maslow (3) has implied the same point 
when he suggests distinguishing between coping 
and expressive behavior. The present paper de- 
velops the argument and makes further inroads into 
the traditional concept of the symptom by suggest- 
ing that in certain cases symptoms may be of an 
autonomous nature and should therefore be at- 
tacked directly. An autonomous symptom may be 
defined as one which is not deeply imbedded in 
the psychic structure of the individual and one 
which does not reflect an underlying conflict. Cri- 
teria for its presence are: absence of maladjustment 
as shown by interviewing and testing, and thera- 
peutic results which do not involve relapse, symp- 
tom substitution, or manifestations of extreme anx- 
iety. These latter characteristics would be expected 
if a true * symptom had been incorrectly identified 
as an autonomous one. 


Evidence supporting this distinction in symptoms 
is also found in the work of Mowrer (4) and Wells 
(6), who were concerned with enuresis and chronic 


headache respectively. By means of a frontal at- 
tack on the symptoms, both report obtaining suc- 
cessful therapeutic results. Follow-up studies years 
later did not indicate the presence of relapse, anxiety 
or symptom replacement. In the field of medi- 
cine proper, suggestive evidence for the presence of 
autonomous symptoms is to be found in the work 
of Thom (5), Heyer (1), Schultze (1), and Schind- 
ler (1). These investigators report obtaining suc- 
cessful results by the direct use of hypnosis where 
the pain, symptom, or disturbance persisted as a 
habit after the somatic basis had been eliminated. 
Schindler, in discussing the problem of incoercible 
vomiting, is paraphrased as stating that it is always 
instructive to see cases of habitual vomiting com- 
pletely and permanently cured by a few hypnotic 
sessions, and again to see many others completely 
refractory to such treatment. To understand both 
therapeutic successes and failures it is important, he 
states, to find out in what layer of the psychic 
structure the symptom has its roots, i.e. whether 
it is true or autonomous. 


1 Read before the Socicty of Clinical and Experimental 
Hypnosis, New York, 1950. 

2 The term ¢rue refers here to the traditional usage 
which regards all symptoms as signs of something else. 
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The following synopsis of a case study further 
supports the thesis that when the symptom is of 
an autonomous nature it may be attacked directly. 


A student came to see the writer with the complaint 
of chronic nail-biting of some 10 years’ duration and of 
sufficient severity to leave his finger ends red, raw, and 
frequently bloody. The reason for seeking help at this 
time was due to a rumor that anyone with such a habit 
would be sent to POE (Port of Embarkation). During 
the course of four subsequent interviews his case history 
was taken and the Rorschach and the TAT were admin- 
istered. The client appeared well adjusted, was doing 
adequately at college, got along well with members of 
both sexes, and had no interpersonal problems. These facts 
were substantiated from outside sources. The Rorschach 
and TAT revealed no psychological pathology. Nail- 
biting had started about the time that the client’s parents 
had separated, at which time: he remembered friction in 
the home. The mother was given custody of the child 
and the father permitted to see the child every so often. 
He adjusted fairly rapidly to the situation, probably due 
to the fact that his relationship with his mother had 
always been good. The nail-biting seemed to reflect 
neither a current conflict nor a past unresolved problem. 
Under the circumstances the writer decided on a direct 
attack on the symptom using hypnosis. The client was 
an excellent somnambule, going into a hypnotic state 
quickly and deeply at the first session (a 20-plus on the 
Friedlander-Sarbin Scale (2) for hypnotic depth). In 
the course of the next eight hypnotic sessions, over a 
four-week period, he was told: (a) that he would not 
feel the urge or necessity to bite his finger nails, and 
(4) that should he ever feel even the slightest urge to 
do so he would chew gum instead and find that this 
satisfied him. Three months after termination of the 
hypnotic sessions the client's fingernails were in good 
condition and had grown. No other change in behavior 
was observed or reported. In response to casuz{ conver- 
sation, it was found that he now chewed about two 
packages of gum weekly while prior to the hypnotic 
sessions he had indicated that he “bought a pack every 
now and then.” There is some question as to the sig- 
nificance of this last finding. To be strictly analogous 
to the cases already cited, the first suggestion given the 
client (not biting his fingernails) should have sufficed— 
and might have. The writer hedged by allowing for 
the possibility of symptom substitution, although it may 
be questioned whether providing a qualitatively different 
and socially acceptable outlet represents a symptom sub- 
stitution, or whether, as indicated above, the second 
suggestion (chewing gum) was at all necessary. A year 
later when the client revisited the writer, his fingernail- 
biting was, he said, a thing of the past and the condition 
of his fingernails substantiated his statement. Again no 
other significant change in behavior was noted or re- 
ported. The extent of gum chewing was approximately 
the same as the previous year. 


Maslow in approaching this problem of symptom 
differentiation from a theoretical point of view 
urges that one distinguish between coping behavior 
(motivated in the traditional sense of need gratifi- 





152 


cation) and expressive behavior (unmotivated in 
the traditional sense of need gratification). He 
cogently states the implications of such a distinction 
for symptum therapy as follows: 


There is a simple enough test, simple at least in theory, 
for making the differentiation between symptoms which 
are truly neurotic, i.c., functional, purposive, or coping, 
and symptoms which are primarily expressive. If a neu- 
frotic symptom does have a function, does do a job for the 
person, then we must assume that the person is better 
off for having this symptom. If it were possible to rob 
the patient of a truly neurotic symptom, then, according 
to theory, he should be hurt in one way or another, i.c., 
thrown into anxiety or acutely disturbed in some other 
way. . 
if, on the other hand, the symptom is not truly func- 
tional, if it does not play some vital role, then ripping 
it away will do no harm, but will only benefit the 
patient. One of the usual strictures against symptom 
therapies is based on just this point, i.¢., assuming that 
the symptom which seems to the onlooker to be wholly 
useless actually plays an important role in the psychic 
economy of the patient, it therefore ought not to be 
tampered with until the therapist knows exactly what 
this role may be. 

What is implied here is that while symptom therapy 
is admittedly dangerous for truly neurotic symptoms, it 
is not at all dangerous for symptoms which are merely 
expressive. These latter may be ripped away with no 
consequence but benefit to the patient (3, p. 268). 


Where Maslow has termed behavioral symptoms 
as either coping or expressive, the present writer 
has called them true or autonomous. In addition, 
this paper suggests an empirical test for differentiat- 
ing between the two kinds ® of symptoms. A symp- 
tom then, whether it be stuttering, nail-biting, enure- 
sis, or a headache, may be autonomous or true. Etio- 
logically the autonomous or expressive symptom un- 
doubtedly is a residue of past difficulties, but its 
present operational definition suggests the use of a 
specific type of therapy, i.e., direct. The important 
point here is that such residual behavior may per- 
sist without apparently being motivated in the usual 
sense of the word. The writer is in agreement with 
Maslow about the danger of confusing autonomous 
and true symptoms and goes further by indicating 
what may happen should such a confusion of 
symptoms occur. 

Three criticisms may be leveled against the ra- 


8 The present paper is concerned with distinguishing 
the two types of symptoms in different persons and does 
not bear on a problem, mentioned by Maslow, of the 
simultaneous existence of both types of symptoms in the 
same individual. 


F. L. Marcuse 


tionale developed above: It may be argued that the 
real roots of the problem were not revealed because 
of the use of inadequate techniques. This point 
basically assumes what is being questioned here, i.e., 
that all symptoms are really true symptoms and the 
use of adequate techniques would reveal this. The 
second criticism states that the case cited involves 
symptom substitution rather than symptom disap- 
pearance. With regard to this criticism, it has been 
mentioned that the substituted symptom (i.¢., chew- 
ing gum) may not have been necessary, but even 
if it were, a minimal claim might be made that 
symptom substitution directed by the therapist to 
a more socially acceptable practice is feasible. The 
final criticism states that not all people are suscepti- 
ble to hypnosis. While it is true that not all 
people are susceptible to hypnosis and that the 
depth of hypnosis required for therapeutic pur- 
poses is problematical, it should be remembered 
that hypnosis is only one technique that may be 
used for a frontal attack on a symptom. 

In summary, the argument is made that in cer- 
tain definable instances a symptom, rather than in- 
dicating an underlying conflict, may be autonomous 
in nature. This latter type of symptom may be 
identified by adequate prior testing and by analysis 
of the therapeutic results obtained. It is further 
suggested that when an autonomous symptom is 
present a direct frontal attack is indicated. It is 
finally argued, on both theoretical and empirical 
grounds, that there are two types of symptoms in 
the minor behavior disorders and that a differentia- 
tion between them should be made. 
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COMMENTS ON “THE J-CURVE REVISITED” 


RICHARD S. SOLOMON 
Dayton, Ohio 


nw the October, 1951 issue of this Journat, Pauline 
N. Pepinsky selected the J-curve hypothesis as 

an interpretive framework for her report on re- 
search involving interracial attitudes. This should 
be of special interest to social psychologists since 
the J-curve hypothesis has long been neglected as a 
valuable method of interpretation in the study of 
institutional behavior. Dr. Pepinsky has admir- 
ably demonstrated this in her study in which the 
J-curve hypothesis might ordinarily not be consid- 
ered applicable. 

There are, however, some oversights and perhaps 
unintentional mistakes which should be clarified. 
The J-curve hypothesis was originally developed by 
F. H. Allport some twenty years ago and elaborated 
on by a small group of his students, including the 
writer, It is one of those unique psychological 


hypotheses which demand a long association and 
experience to be handled with clarity and validity. 

In her article Dr. Pepinsky referred to criticism 
of the J-curve hypothesis by Katzoff and others, re- 
ferring specifically to (a) the limitations of applica- 
tion, (4) the inadequacy of the scaling methods, 


(c) the fact that conformity is arbitrarily defined, 
and (d) the results being subject to ambiguous 
interpretations. 

In reference to these concrete objections, I would 
like to point out the following considerations: 

a. The J-curve hypothesis was developed explicitly 
to be used in the study of behavior in conformity 
or institutionalized situations. Therefore, objection 
is not justified if the hypothesis cannot be appropri- 
ately applied to all types of situations. In the study 
of institutional behavior for the first time we have 
a technique which lends itself to the measurement 
of individual behavior, whereas formerly institu- 
tions were described in terms of mass media and 
nonexplicit ideas characteristic of the sociological 
approach. The hypothesis, therefore, can be ap- 
plied in the study of any social behavior, providing 
it can be observable as the behavior of individuals 
comprising or acting in the situation itself. 

b. This poses directly the problem of scaling. It 
is obvious that in the application of the scientific 
method measurement is essential; a scale based on 
an appropriate continuum is also essential. All 
studies involving the J-curve hypothesis have been 
based on this type of measurement in which an 
appropriate continuum is selected and a scale of 
values is appropriately applied on the continuum. 
This is a relatively simple matter when various con- 
tinua are known to be relevant and are very closely 


associated with an investigator’s experience or are 
commonly known. Such continua are time and 
length sequences, and scale values are empirically 
found in the form of minutes and feet respectively; 
on the Telic continuum such scale values in this 
respect are denoted, for example, as various “de- 
grees of lateness” or “degrees of deceleration.” 

In the study of other types of social behavior, 
however, very often the appropriate or relevant 
continua are unknown and have to be discovered, 
as in any other type of experimental work. This 
is true in the writer’s own study of conforming 
behavior in church, library, and clubroom situations 
where the common relevant continuum was found, 
after much study, to be “degrees of annoyance,” 
produced by conversation. It cannot therefore be 
considered as a criticism if appropriate continua 
are difficult to find. In relation to scaling methods 
themselves, in all important studies involving the 
J-curve hypothesis, the scaling methods were rigor- 
ously developed according to the basic psychophysi- 
cal methods originally developed by Thurstone and 
elaborated on by F. H. Allport. These scale values 
originally were admittedly crude, but in a later 
study by F. H. Allport and the writer, scale values 
were held to rigid specifications in that they were 
appropriate to the continuum, they occurred in the 
correct sequence, and the step intervals were equal- 
ized. In fact, in the writer's own study of annoy- 
ance mentioned above, a technique was developed 
for transferring the steps on a Telic continuum of 
annoyance to an empirical continuum of time 
measurable in minutes and seconds so that it was 
possible for the first time to measure a psychological 
event or feeling in terms of a time interval in an 
institutional setting. 

¢. Almost all critics of the J-curve hypothesis 
seem to take issue by stating that “conformity is 
arbitrarily defined.” This is not true. While a 
conformity situation is defined as existing only 
when 50 per cent or more of the cases fall on the 
first step of the scale, this condition was not an 
“arbitrary stipulation,” as Dr. Pepinsky states. This 
rule was discovered empirically after many various 
types of J-curve distributions were analyzed and it 
was found that those which most exactly fit the 
type of distribution described also had 50 per cent 
or more instances occurring on the first step of the 
scale. This is a statement of fact. However, in 
terms of logic itself it would be contradictory even 
to speak of conformity behavior or the presence of 
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an institutional situation if more than 50 per cent 
of the population involved were not behaving alike. 
d. This leads to the last point of consideration 
which seems to be especially misinterpreted by 
Katzoff as referred to by Dr. Pepinsky. This ap- 
plies to the use of statistical methods of interpreta- 
tion commonly associated with the normal or Gaus- 
sian type of distribution. (In corzespondence Dr. 
Pepinsky has stated that this objection after receiv- 
ing further information does not particularly ap- 
ply.) The possibility for ambiguity in interpretation 
from a theoretical basis is always present, not only 
in relation to psychological fields but universally. 
In order to reduce the possibility of ambiguity, the 
use of standard statistical procedures is suggested, 
which of course is not a unique thought. Specifi- 
cally, it was pointed out that if measures of skew- 
ness were applied they would furnish “an index of 
the degree of conformity present in a particular 
case.” Those who suggest this type of treatment 
have evidently not attempted it. It does not work. 
The reasons for this lie in the differences between 
the theoretical concepts involving the J-curve dis- 
tribution and the normal frequency distribution. 
Measures of skewness and also kurtosis were de- 
signed with particular reference to the Gaussian 
distribution. The normal frequency or Gaussian 
type of distribution is based on an infinite number 
of variables, all of which are not particularly re- 
lated. The J-curve distribution, on the other hand, 
is based on a finite or relevantly few number of 
variables, all of which are intermittently related. 
The four principal variables are thought to be (a) 
conformity-producing agencies which “push” the 
number of cases to the first step of the scale and 
thus account for the piling up of instances thereon; 
(4) simple chance, which operates not in the man- 
ner of compound probability as ir the normal dis- 
tributional curve, but as a leveling out factor, and 
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accounts for the horizontal trend of the J-curve (if 
simple chance operated alone in the absence of 
other factors, a straight-line distribution would be 
obtained horizontal or parallel to the base line); 
(c) physiological or biological properties; and (d) 
personality deviations which, together with physio- 
logical properties, accsunt for instances on the two 
extremes of the J-curve distribution. 

The essential concept in measurement of the J- 
curve distribution is the use of the Telic continuum; 
this is a continuum of purpose and without it the 
J-curve distribution cannot be appropriately referred 
to. Moreover, the scale of values on the Telic con- 
tinuum are arranged in step intervals and are not 
continuous points on the scale as in the empirical 
continuum which are capable of infinite division. 
The reason, of course, is related to the Weber- 
Fechner law so that there is a point below which 
further division is psychologically meaningless. 
Therefore such measures as kurtosis and skewness 
give us invalid results when applied to the J-curve 
distribution. 

As a matter of fact, in practical application the 
writer long ago discovered this in the fruitless or at 
least nonproductive negatively consumed man hours 
of computation. As a result the writer did develop 
a formula resulting in an “index of conformity” 
which was based on the theoretical assumptions 
underlying the J-curve hypothesis. This was pub- 
lished in Sociometry, Vol. II, 1939. 

In conclusion, it is our hope that Dr. Pepinsky’s 
article, as well as these comments, may stimulate 
further research and the application of the J-curve 
hypothesis to the study of institutional or conform- 
ing behavior. If it is understood, it can be a very 
valuable method and technique and can lead to 
fruitful interpretations which might otherwise be 
overlooked. 
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THE METHODOLOGY OF NOTCUTT AND SILVA’S “KNOWLEDGE 
OF OTHER PEOPLE”: A CRITIQUE 


EDWIN COHEN 
The University of Oklahoma 


T is indeed unfortunate when an _ otherwise 
I valuable experiment, such as that reported by 
Notcutt and Silva (1) is spoiled by errors in statisti- 
cal design and inference. At least four unjustifiable 
steps appear in their procedure. 

1. They state, “In the absence of special informa- 
tion, we assume all self-ratings to be equally prob- 
able” (1, p. 33). This “special information” could 
easily be obtained by constructing a simple fre- 
quency distribution of the self-ratings actually ob- 
tained. The distribution of predictions could be 
obtained in like manner. These empirical distribu- 
tions, if bell-shaped, as might be expected, rather 
than rectilinear, as tacitly assumed, would yield 
smaller theoretical errors than they obtain. Inas- 


much as their “demonstration” of insight rests upon 
the actual error being less than the theoretical error, 
their probable inflation of the theoretical error (by 
their assumption of rectilinearity) invalidates their 


inference. 

2. In their example, in Table 2, showing evalua- 
tion of one judge's errors of prediction, they use 
chance errors based upon the mean errors attached 


to each self-rating, from Table 1. Aside from the 
invalidity of their chance errors, as already men- 
tioned, there is no a priori reason for using self- 
rating, rather than prediction, in evaluating the 
chance errors. 

3. Another broken link in their chain of infer- 
ence is their uncritical use of the ¢ test. The ¢ test 
is not applicable inasmuch as its use requires nor- 
mal distributions, and it is evident from Fig. 2 that 
the predictions are not normally distributed. 

4. Although not crucial to their arguments, their 
omission of cases of insight inferior to chance (1, 
Pp. 35) appears very questionable. There seems to 
be no reason to believe that predictions of certain 
people may not be consistently worse than chance. 
Just as the presumably positive batting averages of 
clinicians may be an index of competence, so might 
a negative batting average be an index of malad- 
justment. 
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Soctat Psycuotocy. By Robert E. L. Faris. New 
York: The Ronald Press Co., 1952. Pp. vii 
+ 420. Price $5.00. 


One of the most eloquent testimonies to the 
fact that social psychology is an amorphous, ill 
defined field is the variety of legitimate points of 
view from which a textbook on social psychology 
may be written. Professor Faris’ text is written 
from the standpoint of a sociologist steeped in the 
tradition of G. H. Mead and the author’s eminent 
father Ellsworth Faris. The principles of individual 
psychology are held to be insufficient for the under- 
standing of social behavior, and Professor Faris con- 
siders society to be more than the sum of its com- 
ponent individuals. It is the influence of society 
on the developing individual that accounts for man’s 
characteristic adult personality. Both behaviorism 


and psychoanalysis are rejected by the author as 
being either insufficient or unnecessary for the 
explanation of the adult personality. While the 
psychological data and theories invoked are, in 
general, eclectic, Professor Faris’ view of personality 
is most akin to that of Gardner Murphy. 

Faris emphasizes socially conditioned man as 


the final product of onto- and phylogenesis; he 
dwells, therefore, at length on those aspects of 
society that influence man’s adult personality, and 
on certain concepts of personality which are seldom 
treated in detail by psychological social psycholo- 
gists, i.¢., consciousness and self-consciousness. Be- 
cause of this orientation, he does an unusually 
good job of describing the process of socialization 
in later childhood and adulthood, as it is influenced 
by social factors outside the family. As a result 
of his rejection of psychoanalytic theory, however, 
there is a proportionate lack of emphasis on early 
childhood socialization. Since the error is usually 
made in the other direction, Faris’ emphasis is 
refreshing and instructive, but is no more balanced 
than the opposite form of overstatement. 

The reviewer would feel more at ease if there 
were some few invariant criteria by which to judge 
textbooks in social psychology. Inevitably the 
theoretical prejudices and research interests of the 
reviewer come into play in such an evaluation. 
Since I find I am unable to label myself with 
any neat set of theoretical prejudices and research 
interests, I am incapable of doing the honorable 
thing of making these explicit. Nevertheless, I 
find a few lacks in this book, and the reader may 
make such discount as seems warranted. 

1. There is a generally anachronistic tone to 


much of the author’s argument. To some extent, 
I had a feeling that he was often fighting over 
issues that had been settled a decade ago. In his 
historical review, for example, Faris concentrates on 
the limitations of McDougall’s instinct theories and 
makes no mention of the very valuable contributions 
of McDougall to a more dynamic approach to 
social psychology, especially to the modern ap 
proaches to perception and attitude formation. 
Furthermore, rather than argue that “instinct” is 
not a sufficient construct for the explanation of 
social behavior, the author might have done better 
to devote equal attention to explicating the role 
which biological factors play in the development of 
motives. 

2. While there is much to applaud in the chap- 
ters on “The Neurotic Role” and “Personality Dis- 
organization,” it is peculiar indeed to find so little 
reference to the effect of early childhood experience 
on character formation. It is one of Faris’ very real 
merits that he spells out in such detail the stress- 
producing effects of situational factors which are 
largely socially determined. It is a defect, however, 
to ignore the extent to which early experience pre- 
disposes one to react differently to such stresses. 

3. Certain traditional areas of social psychology 
(group prejudice, national character, industrial con- 
flict, attitude measurement and survey research) are 
either missing or treated in a summary fashion. 
Perhaps such topical areas have no place in a 
systematic work—and this text is more systematic 
than most. Nevertheless, most social psychologists 
will find this a defect in using the book for 
teaching purposes. 

To some extent these defects are obviously but 
the reverse sides of the book’s merits. Because 
of the immense range of topics and viewpoints 
which social psychology embraces, it is impossible to 
write a textbook which meets all needs. Most 
psychologists will probably find the emphasis too 
heavily sociological for their interests and, therefore, 
may not care to use it as a text for the intro 
ductory course in social psychology. To the extent 
that this is true, however, Faris’ book is to be 
heartily recommended as a supplementary text. 
The latter half of the book fills in gaps which 
characterize uniformly the work of individually 
oriented social psychologists. 

The book is well written and organized. It is 
a scholarly and competent job, and eminently suited 
to undergraduate teaching. 

Raymonp A. Bauer 

Harvard University. 
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Tue Art or Askinc Questions By Stanley L. 
Payne. Princeton: Princeton University Press, 
1951. Price $3.75. 


This book is largely a summary of Mr. Payne's 
wide experience in practicing the art of question 
wording. It is written primarily for the researcher 
who uses or is considering using the questionnaire 
and the interview technique to collect data on 
public opinion. A few quotations from the preface 
will indicate the purpose and scope of the book. 
“The author is just a general practitioner in re- 
search. Having made more than my share of 
mistakes in appraising issues for public consumption, 
and feeling the need for a book on the subject, I 
found it necessary to write it myself. . . . This 
book is very limited in its subject matter. It dis- 
cusses the wording of single questions almost ex- 
clusively. It hardly touches upon problems of 
question sequence or the overall manner of ques- 
tionnaire design. .. . As it stands, this book con- 
sists of some observations of human behavior, a 
few principles of wording, many exceptions of 
these principles, several unexplained oddities and 
numerous unexplained dilemmas. It is undoubtedly 
richer on the how-not-to side, than on the how-to 
side. For want of a better description, it might 
be thought of as a collection of possible considera- 
tions for question wording” (p. xi). 

As these statements suggest, the book is a com- 


pilation of the present state of experience and 


knowledge in the art of asking questions. It is 
wisdom based on a good deal of experience in 
problems of wording many questions. It is based 
very little on research findings. It is a compilation 
of the “do’s” and “don'ts” of question wording, 
particularly from the semantic aspects of questions. 
Each point which is made is thoroughly illustrated 
from the author’s experience in question wording 
and from existing studies in public opinion and 
market research fields. 

“For the student or the neophyte in question 
wording, this book will be of considerable as- 
sistance in avoiding many of the semantic problems 
of question wording. This is by far the most 
comprehensive work in the field of which this 
reviewer is aware. The student who is given 
this book to read is in for a pleasant shock since 
the writing is in such “non-textbookish” style 
that it can be read almost like a novel. The author's 
ready wit and careful selection of illustrations make 
the book very readable. 

One criticsim of the book is that questions are 
discussed as though they were the end products of 
the research rather than in terms of the research 
objectives of the question. An example is found in 
the discussion of the open or free-response question. 
The reader will conclude that in some ways the 
open question is good and in other ways it is 
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bad. This is, of course, correct; however, the 
criteria for deciding the type of question to be 
used should be based primarily on the research 
objectives. The reader may miss the fundamental 
point that a question must be evaluated from two 
points of view. First, whether it covers the research 
objective for which it was designed; second, whether 
it accurately communicates the research objects to 
the respondent. 

As the author states, he does not include dis- 
cussions on question sequence, over-all design of 
the questionnaire, or the use of multiple questions 
to meet a single objective. It is unfortunate that 
these topics are omitted, since often the solution to 
the lack of the “perfect question” is the use of 
several interlocking questions. It is, in fact, seldom 
that an issue worth studying can be covered by 
one question, no matter how well designed. Thus, 
in major part the problems of question wording 
cannot be discussed adequately without considering 
the interrelationships and sequence of the questions. 

The author spends some time discussing the 
language level and sentence complexity appropriate 
to the general population. He states that the goal 
is to word questions to the lowest educational level. 
It should be clear that such an admonition is appro- 
priate only when one is interviewing the general 
population. The level of language for special 
groups (physicians, factory managers, college grad- 
uate students, etc.) would be quite different. The 
generalization might better be stated to make the 
language level appropriate to the group which is 
being studied. 

To sum up, this book is an excellent guide to 
the researcher who is about to undertake a cross- 
section sample study using poll-type questions. It 
will be of less use to the social scientist who is 
designing a questionnaire to investigate complex 
social issues. 

Cuarres F. CaAnNELL 

University of Michigan. 


PsycHoaNALysis 4ND Group Benavior. By Saul 
Scheidlinger. New York: W. W. Norton, 1952. 
Pp. xviii+245. $3.75. 

Scheidlinger’s motivation for writing this book 
appears to have stemmed from a convictien that 
Freud's Group Psychology and the Analysis of 
the Ego has been seriously neglected by educators, 
group workers, social psychologists, and even group 
therapists. To help remedy the resulting blind- 
ness among these professions he has written a 
book made up of two major parts. The first con- 
sists of 156 pages of exposition, summary, and a 
rather feeble critique of the major Freudian con- 
cepts of group relations. The second attempts to 
show, in 68 pages, the implications of the Freudian 
concepts for other fields of group study and for 
practice in education and group therapy. 
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How good a remedy is this book? Having read 
it, will a person dealing with groups possess better 
vision? The answers to these questions must 
depend primarily upon the adequacy of Scheid- 
linger’s diagnosis. Certainly for anyone who has 
not read Freud's classic volume and for those 
unfamiliar with such publications 2s Redl’s impor- 
tant papers on group emotion, leadership, and 
contagion, this book will introduce significant work. 
If the psychoanalytic approach to groups has been 
neglected through unfamiliarity with the field, here 
is a good introduction to most, though not all, of 
it. If, on the other hand, neglect has been due 
not to unfamiliarity but rather to a failure to be 
convinced, then I doubt that this book will have 
a very marked influence. 

Let us consider first how adequately the psycho- 
analytic approach to groups is presented. As sug- 
gested above, Scheidlinger is mainly interested in 
expounding the views of Freud, himself. Subse- 
quent psychoanalytic writers are presented primarily 
to elaborate upon points made by Freud or to 
indicate that a lot of people agree with Freud. 
Greatest reliance is placed upon the writings of 
Fenichel, Anna Freud, Hartmann, Kris, Redl, and 
Slavson. Conspicuous by their absence are Bion 
and the group of therapists drawing upon the 
work of Melanie Klein, such as Ezriel and 
Sutherland. Apart from this serious deficiency, 
the book serves as an excellent summary of the 
major components of psychoanalytic theory about 
groups. Of especial value are Chapter 4, on leader- 
ship, and Chapter 6, which presents in 15 pages 
a remarkably concise statement of the theory. 
This latter chapter should find its way onto all 
bibliographies of “approaches to the theory of 
groups.” 

As a document to convince informed skeptics, 
this volume has several fundamental weaknesses. 
Here are a few of the more salient ones. 

The book rests its case basically upon plausibility 
and authority rather than empirical evidence open 
to independent verification. Time after time the 
author asserts that “So-and-so showed.” How 
Mr. So-and-so made his demonstration is never 
explained. Nor can one tell whether the observa- 
tions were based upon a single group or a popu- 
lation of groups, whether the groups were organized 
for therapy or other purposes, what their composi- 
tion was, whether they were a part of a larger 
social system, and the like. A skeptic can cease 
doubting only if the assertions seem reasonable or 
if he is willing to believe that the original investi- 
gator is to be safely taken on faith. In fairness 
to Scheidlinger, it should be noted that he explicitly 
notes this difficulty, but it does not appear that he 
took his own criticism seriously enough. 

Most skeptics of psychoanalytic group theory will 


themselves hold to other theoretical positions. 
Scheidlinger recognizes this competition by includ- 
ing a chapter devoted to a comparison between 
Freudian concepts and “other fields of group study.” 
These other fields are “sociology and social psy- 
chology,” “sociometry,” “field theory,” and “social 
group work.” He disposes of each in a minimum 
of pages. Sociology and social psychology are 
blamed for ignoring Freud or for misrepresenting 
him. This accusation is bolstered by quoting a 
few prestigeful social scientists who have seen the 
light and have chided their colleagues for their 
narrow views. Sociometry is criticized primarily 
because the sociometric choice does not fully reflect 
unconscious motivations and because it has been 
the author’s experience that grouping on the basis 
of such choices often leads to difficulties in the 
group. Scheidlinger is more ambivalent about 
field theory. Unfortunately the most recent work 
with this orientation that he cites is the study on 
social climates conducted by Lewin, Lippitt, and 
White in 1939-40. He relies heavily upon criti- 
cisms and evaluations made at the time Lewin was 
first attracting attention in this country (for 
example, Tolman’s paper on “Lewin’s Concept of 
Vectors,” 1932, and Brown’s book, Psychology and 
the Social Order, 1936). His comments on Lewin’s 
beliefs about ahistorical causation show no evi- 
dence that he has read Lewin’s major writing on 
the topic. On the other hand, he appears to favor 
a reconciliation of field theory and psychoanalytic 
concepts in which field theory “would contribute 
its highly useful group dynamic concepts and 
methodological experience, while Freudians would 
supply their wealth of data in the area of uncon- 
scious motivations and mechanisms, as well as 
genetic propositions” (p. 177). As one field theorist, 
I would welcome such a reconciliation and am 
pleased to know that several projects are currently 
working toward this goal. The treatment given 
to social group work consists essentially of a book 
review of a recent text in the field and a state- 
ment of hope that group workers will also con- 
tribute to work in this field. 

Scheidlinger’s own experience and original 
thinking are reflected most fully in the two final 
chapters devoted to educational practice and group 
psychotherapy. But here, too, he fails to impress 
the skeptics of psychoanalysis. It is perhaps signifi- 
cant that his own cathexis is clearest in the chapter 
on educational practice and that this chapter appears 
to have the least relation to psychoanalytic con- 
cepts. He notes that educators have been influenced 
by Freudian thinking, and not always to the 
advantage of the students since certain features of 
Freud's thinking were usually emphasized at the 
expense of others. He urges teachers to become 
more aware of the class as a group. He presents 
an interesting analysis of the various roles the 
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teacher may take in a class. And he presents 
some case material showing how a group worker 
can help teachers do a better job. This chapter is 
both original and helpful, but it does little to 
persuade the skeptic of the importance of Freudian 
concepts. 

The final chapter on group psychotherapy briefly 
describes some of the thinking in this field. It 
suggests that there are some controversies among 
various therapists, but the analysis and exposition 
do not go deeply enough to elucidate clearly the 
fundamental issues. Rather, the author escapes 
into case material to “illustrate” how certain group 
dynamics aspects work in therapy groups. 

A final word of evaluation. If one compares 
this volume with recent publications devoted to 
theories of groups, one is impressed by the fact 
that competing theories often don’t really compete 
at all. Various theories are often in fact not differ- 
ent theories about the same phenomena but simply 
different theories about different phenomena. Few 
would disagree that ultimately we should have a 
single integrated theory of all group phenomena. 
However, in view of the staggering complexity of 
group phenomena it may well be strategically sound 
to devote our immediate efforts toward the con- 
ceptual clarification and empirical testing of the 
various complementary theories rather than try to 
build all-inclusive “master theories.” 1 would sup- 
port this position by reference to Homans’ recent 
book on The Human Group in which he attempts 
to develop a broad and integrated theory. In this 
he employs the concept “sentiment” as an “element 
of social behavior.” Now it is clear that Scheid- 
linger’s whole book is about this alleged “element,” 
and Scheidiinger can certainly criticize Homans 
for oversimplifying and ignoring much about con- 
scious and unconscious motivation. While it is 
true that Homans, too, can legitimately criticize 
Scheidlinger for narrowness in ignoring his other 
“elements,” complete coverage should not be pur- 
chased at the cost of oversimplification and 
superficiality. 

Dorwin CARTWRIGHT 

University of Michigan. 


Famity CENTERED MATERNITY AND INFANT Care. 
Edited by Edith B. Jackson and Genevieve 
Trainham. New York: Josiah Macy, Jr. 
Foundation, 1950. Pp. 29. $0.25. 

ProsLems oF INFANCY AND CuiLpHoop. Edited by 
Milton J. E. Senn. New York: Josiah Macy, 
Jr. Foundation, 1951. Pp. 181. $2.25. 


SymposiuM ON THE Heattuy Personatity. Edited 
by Milton J. E. Senn. New York: Josiah 
Macy, Jr. Foundation, 1951. Pp. 298. $2.50. 


For several years the Macy Foundation has been 
sponsoring conferences on problems of infancy 
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and childhood. The present reports come from the 
fourth conference and from. two special meetings 
held for the purpose of preparing material on 
healthful development to be submitted to the 
Midcentury White House Conference in December, 
1950. The pamphlet by Jackson and Trainham is 
a brief and businesslike report on the objectives, 
development, and present distribution of the room- 
ing-in plan in maternity wards. The two volumes 
edited by Senn each contain three prepared papers 
and also the verbatim spontaneous comments con- 
tributed by the members of the conference. The 
Macy Foundation is almost infinitely hospitable 
to discussion; the two books together contain no 
less than 273 pages of it. ad 

Problems of Infancy and Childhood opens with 
a paper by George Peter Murdoch and John W. 
M. Whiting entitled “Cultural Determination of 
Parental Attitudes: The Relationship between the 
Social Structure, Particularly Family Structure, and 
Parent Behavior.” This anthropological report is 
followed by a contributica from animal psychology: 
T. C. Schneirla gives “A Consideration of Some 
Problems in the Ontogeny of Family Life and 
Social Adjustments in Various Infrahuman Ani- 
mals.” Both reports tell of research in progress 
as well as research already accomplished. In the 
final paper, entitled “Working Toward Healthy 
Personality,” Lawrence K. Frank surveys broadly 
the problems involved in summarizing what we 
know about healthy development and in present- 
ing it in a useful form to parents, teachers, and 
guidance workers. 

Frank’s theme is pursued in the Symposium on 
the Healthy Personality. Erik H. Erikson opens 
with a paper on “Growth and Crises of the 
‘Healthy Personality,’” developing and clarifying 
some of the ideas, especially that of ego identity, 
set forth in his recent Childhood and Society. Then 
comes an information-packed survey by M. F. 
Ashley Montagu of “Constitutional and Prenatal 
Factors in Infant and Child Health.” In the 
closing paper, “Toward a Social Psychology of 
Mental Health,” Marie Jahoda undertakes a defini- 
tion of mental health and suggests various direc- 
tions for research on the ways in which health is 
influenced by environmental factors. 

Readers interested in the central themes of these 
publications can readily grasp them by reading 
merely the prepared papers, which are of high 
quality. Those who are trying to develop their 
own thinking on any of the topics will find it 
worth while to dredge their way through the 
discussions, where they will find a rewarding 
number of edible oysters, some of which even 
contain pearls, 

Rosert W. Wuire 


Harvard University. 
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